Gene Franean1_5074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5074 
Symbol 
ID5673409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6074075 
End bp6075211 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content74% 
IMG OID641243925 
Productbifunctional RNase H/acid phosphatase 
Protein accessionYP_001509339 
Protein GI158316831 
COG category[G] Carbohydrate transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0328] Ribonuclease HI
[COG0406] Fructose-2,6-bisphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0171562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0167384 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCTG GGCGCAGACT GGTCGTCGAG GCCGACGGAG GATCCCGCGG CAATCCCGGG 
CCCGCCGGGT ACGGCGCCCT CGTCCGGGAC GCCGGCACCG GCCAGGTGCT GGCCGAGCGG
GCCGCCTCGA TCGGCACGGC CACCAACAAC GTCGCCGAGT ACGAGGGCCT GCTCGCCGGC
CTGCGGGCCG CCGCGGAGCT CGACCCGGGC GCCGACATCG AGGTCCGGAT GGACTCCAAG
CTCGTCGTCG AGCAGATGAG CGGCCGTTGG AAGATCAAGC ACCCGTCGAT GCGCCCGCTG
GCGGCCCAGG CCGCCGAGAT CGCCGCTGGG CTCGGGCGGG TCCGGTTCGT CTGGGTGCCC
CGGGCCCGCA ACGGCGACGC CGACCGGCTG GCGAACGAGG CGATGGACGC CGCCGCCGCC
GGCCGGTCCT GGGAACCCTC CGTCCCGCAG TCGCCCGACC CGCTGCCGCA CCAGGCGCCC
ACCACGAACC GGCTCTCCGG CTGGATGGCC CCGCCCGCTC CGCCGACCAC GACGGTCCTG
CTCCGTCACG GCCAGACGCC GCTGTCGGTG GAGAAGCGGT TCAGCGGAAC CGTGGAGGCC
TCACTCACCG ACCTCGGGCT GGAGCAGGCC GCCGCCGCTG CGGACCGGCT GCGCGGCGAG
CCGTTCGACG CCGTGATCAG CTCGCCGCTC AAGCGCGCCC GCCAGACCGC GGACGCGCTG
GGCCGCGACT ACCTGATCGA CGAGGACCTG CGGGAGACCG ACTTCGGCGC GTGGGAGGGG
ATGACGTTCG CCGAGGTCCG CGAGCGCTTC CCGGACGAGC TGAACGCCTG GCTCGCCGAC
CCGGCGGTGC CGCCGCCGGG CGGGGAGAGC CTGCTGAGCA CGGGCGCCCG TGTCGCGGCG
GCCCGCGACC GGATCATGGC CCAGTATCCC GCCGGGCGCG TCCTCGTGGT CTCGCACGTG
ACCCCGATCA AGGGGCTCAC CCAGCTCGCG CTCGCCGCCG AGCCGACCGT GCTCTACCGG
CTGCACCTGG ACCTGGTGTC GATCACCACC GTCGACTGGT ACTCCGACGG ACCCGCGGTG
CTGCGGGGCT TCAACGACAC CCACCACGTC GCCCATCTGG TCATTCCCGG CGAGTAG
 
Protein sequence
MSAGRRLVVE ADGGSRGNPG PAGYGALVRD AGTGQVLAER AASIGTATNN VAEYEGLLAG 
LRAAAELDPG ADIEVRMDSK LVVEQMSGRW KIKHPSMRPL AAQAAEIAAG LGRVRFVWVP
RARNGDADRL ANEAMDAAAA GRSWEPSVPQ SPDPLPHQAP TTNRLSGWMA PPAPPTTTVL
LRHGQTPLSV EKRFSGTVEA SLTDLGLEQA AAAADRLRGE PFDAVISSPL KRARQTADAL
GRDYLIDEDL RETDFGAWEG MTFAEVRERF PDELNAWLAD PAVPPPGGES LLSTGARVAA
ARDRIMAQYP AGRVLVVSHV TPIKGLTQLA LAAEPTVLYR LHLDLVSITT VDWYSDGPAV
LRGFNDTHHV AHLVIPGE