Gene Franean1_0121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0121 
Symbol 
ID5668546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp145039 
End bp146910 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content72% 
IMG OID641239049 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001504494 
Protein GI158311986 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.412208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGTCC GTCACGCTCC GTTCCCGGTG TCCCGGCGCA CCTGCATGCC TGGCGGCATG 
ATGGGGCACA TGGGAGGCGA CGAGACCGAG CGGTCGGGCG CGGACATCGA GCGTCGGGAC
GACTGCGACG TTCTGGTGAT CGGCAGCGGG TTCGGGGGCA GCGTGACCGC GCTCCGCCTG
GCCGAGAAGG GCTACCGCGT CACGGTCGTC GAGGCCGGCC GCCGGTTCAC CCCGGCGACA
CTGCCGAAGA CGAGCTGGGA CATCCGCAGG TACCTGTGGC TTCCGCGGCT CGGGTGCCAC
GGGATCCAGA AGATCACCCT GCTCGGGCGG GTGCTGGTGC TCTCCGGAGC AGCGGTCGGC
GGTGGATCGG TCGTCTACGC GAACACGCTG TACAGGCCCC TGGACGGTTT CTACGACGAT
CCCCAGTGGG CGGACATCGC CGACTGGCGT GCCGAGCTCG GGCCGTTCTA CGGCCAGGCC
GAGCGCATGC TCGGCTCCAC CCGGAACCCG TCGATGACCA TGGCGGACGA GGTCTTCCGG
GACGTCGCAC GGGATATGGG TGTCGGGGAC ACCTTCCGCC TGGCCGACGT CGGCGTGTTC
TTCGGCCGCG ACGGCCAGCG CGAGCCGGGG GTGACGGTTC CCGATCCCTA CTTCGGCGGC
GCCGGCCCCG CCCGGACGGG CTGCGTCGAG TGCGGCGAGT GCATGACGGG ATGCCGGCGC
GGCGCGAAGA ACACCCTGGA CCGCAACTAC CTCCACCTGG CCGAAGGGCT GGGCGCCCGC
ATCGTGGCGG ACACGACCGT GCGCTCGGTG CGCCCCGACG GGCGCGGTGG CTACGAGGTC
GAGACGGTCC GCACCGGCAG CCGGCGGCGG GACGCGCGGC GCTGGACGGC CCGCCAGGTC
GTGTTCGCCG CGGGCGCCCT GGGAACCCAG CGCCTCCTGC TCGGCATGCG CGAACAGGGG
CACCTGCCCG GCATCTCCGA CCGGGTCGGC CACCTGACCA GGACGAACTC CGAGGCGATC
CTGGGCGCGA CCCGGCTGCG GCCGGATGGG CGGATCACCC GGGGGGTGGC CATAACGTCG
TCCTTCTACC CGAACGATCA CACGCACGTG GAACCTGTCC GGTACGGCCG GGGCAGCAAC
CTCATGGCGT TCCTGTCGAG CGCCATGACC GACGGCGGAG GCTCCCTGCC ACGCTGGGCG
AAGCACCTGC TGCTGCTGGC CCGCCGCCCG TACCTCACGC TGCTCGCCCT GCCGTGGCGG
TGGTCGGAGC GAACCATGAT CGCTCTCGTC ATGCAGTCCC GGGACAACTC GCTCACCGTC
AGCCGGCGCC GCGGGCCGTT CGGGACGTCC TGGCTGACGA GCCACGCTGG CCACGGGGAG
TCCAACCCGA CCTGGATACC GGAGGGCAAC CAGGCGGCGC GGCTGGTGGC GGAGCGGCTG
GGCGGGCATC CGGGAGGCGC GATCACCGAG CTGGCCGACA TCCCGCTGAC GGCGCACATC
CTCGGCGGGG CGGTCCTGGC CGCTGATCCC GCCCGGGGCG TGATCGACCC GTATCACCGG
GTTTTCGGCC ATCCCGGCCT GCACGTTGTG GACGGCTCGG CGGTGCCGGC GAACCTCGGG
GTGAACCCCT CGCTGACGAT CACGGCGATG GCGGAGCGGG CGATGGCCTT CTGGCCGAAC
CGGGGTGACA CCGACCCCCG CCCGCCGGTG GGCGCGGCGT ACCGGCGGAT CGCGCCGGTG
GCACCGCGGA ACCCGGCGGT GCCCGCGGAT GCACCGGGAC ATTATGAAAT AAGTGCCCTG
ACCAGCACGA ATACTGATCT TCCCCGGGAT GCTGTAGCTG GCGGTGGCCG GGAGACTCCT
CACAATATGT GA
 
Protein sequence
MVVRHAPFPV SRRTCMPGGM MGHMGGDETE RSGADIERRD DCDVLVIGSG FGGSVTALRL 
AEKGYRVTVV EAGRRFTPAT LPKTSWDIRR YLWLPRLGCH GIQKITLLGR VLVLSGAAVG
GGSVVYANTL YRPLDGFYDD PQWADIADWR AELGPFYGQA ERMLGSTRNP SMTMADEVFR
DVARDMGVGD TFRLADVGVF FGRDGQREPG VTVPDPYFGG AGPARTGCVE CGECMTGCRR
GAKNTLDRNY LHLAEGLGAR IVADTTVRSV RPDGRGGYEV ETVRTGSRRR DARRWTARQV
VFAAGALGTQ RLLLGMREQG HLPGISDRVG HLTRTNSEAI LGATRLRPDG RITRGVAITS
SFYPNDHTHV EPVRYGRGSN LMAFLSSAMT DGGGSLPRWA KHLLLLARRP YLTLLALPWR
WSERTMIALV MQSRDNSLTV SRRRGPFGTS WLTSHAGHGE SNPTWIPEGN QAARLVAERL
GGHPGGAITE LADIPLTAHI LGGAVLAADP ARGVIDPYHR VFGHPGLHVV DGSAVPANLG
VNPSLTITAM AERAMAFWPN RGDTDPRPPV GAAYRRIAPV APRNPAVPAD APGHYEISAL
TSTNTDLPRD AVAGGGRETP HNM