Gene Franean1_5203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5203 
Symbol 
ID5673537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6246799 
End bp6247836 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content73% 
IMG OID641244057 
ProductdTDP-4-dehydrorhamnose reductase 
Protein accessionYP_001509467 
Protein GI158316959 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1091] dTDP-4-dehydrorhamnose reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGTAC TGGTCCTCGG CGGCGACGGC ATGCTCGGCG GTGAGCTGGT CCGCCGGCTG 
GTCCGCGACC ATGACGTCAC CGCCACCGTC AGGGCCACCG CACCGTCCTG CCCGCCACCC
GCCGACCGGG TGCTCAGCGG CGTGGACGTC CGCCGCCGCG ACACCATGGT CGACGCGTTC
GCCGCGGTGC GGCCGGACGC CGTCGTGAAC GCCGTCAGCC TGGTCGGGCG GCGCGCGGAG
GGACGTGCCG AGCTGTCGGC GATCGAGGTC AACGCCCTTT TCCCGCATCG CCTCGCGCGC
CTGTGCCAGG CCGCCGGGGC GCGACTGGTG CACGTATCGA CCGACTGCGT GTTCTCCGGC
CGCCTGGGTG ACTACCACGA GGAGGATGTG CCCGACCCGG TGGACGTCCA CGGGATGACC
AAGCTGCTCG GCGAGGTGAC CGAGCCGGGC ACGCTCACGC TGCGGACGTC CGTCGTCGGC
CTGGAGGCGG TACCGGCGGC CTCGGGGCTC GTCGAGGGGT TCCTGGCGGC GAAGGGTGAG
ATCCCGGCCT CGCGCCGGGT CGTCCACAGC GCTCTGACCA CCGCCGAGTT CGCGCGCTTC
GTCCACCTCG TGCTCGTGGG GCACCCGGAC CTGACCGGGA TCTGGCATCT CGCGTCCGAG
CCGATCAGCC GGTTCGACCT GCTCACCATG CTCGCCGACC GGCTCGGCCG GCGGGACGTC
AAGATCGTCC CGAGCGACGG CGAGGCCCGC AACCGGGCGC TGTCCGCGCG CCGGCTGTGG
TCGGAGACCG GCTACCTGCC GCCCGGGTGG CCGGCGATGG TCGACGAGCT GGCGACCGCG
ATCGAACGGC GTGACATCGA AGGCTCCTCC CGGCGTCTCC CGGCACCTCG TCCCGGCGGA
CCGGAGGCTC CCGCGCCGCC CTCACCTCCC CGAACGGAGT GCACGCCGAT GCCTGACAGG
AACCTCGACG AACACGGCTC CGGGAACCGG CCGCCGCATC AGCCCTGGTC GGCCGAGCCG
GTCGATCCGC TCCGGTAG
 
Protein sequence
MRVLVLGGDG MLGGELVRRL VRDHDVTATV RATAPSCPPP ADRVLSGVDV RRRDTMVDAF 
AAVRPDAVVN AVSLVGRRAE GRAELSAIEV NALFPHRLAR LCQAAGARLV HVSTDCVFSG
RLGDYHEEDV PDPVDVHGMT KLLGEVTEPG TLTLRTSVVG LEAVPAASGL VEGFLAAKGE
IPASRRVVHS ALTTAEFARF VHLVLVGHPD LTGIWHLASE PISRFDLLTM LADRLGRRDV
KIVPSDGEAR NRALSARRLW SETGYLPPGW PAMVDELATA IERRDIEGSS RRLPAPRPGG
PEAPAPPSPP RTECTPMPDR NLDEHGSGNR PPHQPWSAEP VDPLR