Gene Moth_1093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1093 
Symbol 
ID3833059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1121295 
End bp1122392 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content63% 
IMG OID637829021 
Productcobalamin (vitamin B12) biosynthesis CbiG protein 
Protein accessionYP_429950 
Protein GI83589941 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2073] Cobalamin biosynthesis protein CbiG 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGGG AAAAGAGGCT GGCCATTGTA GCCCTTACCC GGCCGGGGCT GCAAACAGCC 
CTGCGCCTGG CTGGTAGCCT GCCCGAAGGG ACCCCGGTTT TTGCCCCGGC CAGCCTGGCC
GCGGGTGGCG CCGAAACGGA TGGGGAAATA AGGGTTGATT TTTATTCCGG CGGGTTACCA
GATTTCCTGG GGGAGATTTT TCACCGGTAC CGGGGCTTGA TCCTGATCAT GGCTGCCGGT
ATTGCCGTCC GGGCGTTGCG TACCCACATG GTATCCAAGT TGACCGACCC GGCTGTTGTA
GTAGTTGATG CTGCTGGTAA ATATGCTATC AGCCTGCTCT CCGGCCACCT TGGGGGTGCC
AACGAGCTGG CCCGCCGGGT GGCCGCCATC CTGGGAGGAG AGGCGGTGAT TACCACGGCC
AGCGAAAGCC GGGGCCTCCC GGCCCTGGAC CTGGTTGCCC GGCGCCTGGA AATGACTATC
TGGCCCCGAG ACAATATGAC AATGGTGATG GCCGCCCTGG TGAATGGTGA AGCTATTGAC
CTGCTGGTGG AACCTCCCTT GCTGGCACGC CTGCAAGGCG AACTCCCGGA CTTGGGAGCC
CGCCCGCTGG AGGGGTACTC CGGGGTCAGG GGTGAGGGGG CCGGAATTAT GGTCACCTGG
CGGCGACTGC CTCTGCCCGG ACCGCGCTGG GTTTACTGGC GGCCGCGGGT CATAGTCGCT
GGCGTTGGTT GCCGCCGGGG CGTACCTGCT GGCACCATTC TCTATGCCCT GGGGGTGGCC
TTGAAAAGGG CCGGTATCAG CCGCCAGAGC CTGCGATCAC TGGCCAGTGT GGATTTCAAG
GCCCGGGAGC CTGGCCTTGA GCTGGCGGCC CGGCAGCTGG GGTTGGAGTT ACGTACCTTT
CCACCGGATG AACTAGCCGC CTGCCTGGAA AGACACCCGG AACTGTCCCG TTCCCAAACT
GTAGCTGCCA GGGTGGGTTT AACCGGAGTA TGCGAGCCAG CGGCCGTGCT GGCAGGAGGA
GATGGTGAAT TATTATGGCC CAAAATAAAA TGCCGGGGGG TAACCATCGC CTTGGCCCGG
GTTCAAGGGG CGAAATAG
 
Protein sequence
MAREKRLAIV ALTRPGLQTA LRLAGSLPEG TPVFAPASLA AGGAETDGEI RVDFYSGGLP 
DFLGEIFHRY RGLILIMAAG IAVRALRTHM VSKLTDPAVV VVDAAGKYAI SLLSGHLGGA
NELARRVAAI LGGEAVITTA SESRGLPALD LVARRLEMTI WPRDNMTMVM AALVNGEAID
LLVEPPLLAR LQGELPDLGA RPLEGYSGVR GEGAGIMVTW RRLPLPGPRW VYWRPRVIVA
GVGCRRGVPA GTILYALGVA LKRAGISRQS LRSLASVDFK AREPGLELAA RQLGLELRTF
PPDELAACLE RHPELSRSQT VAARVGLTGV CEPAAVLAGG DGELLWPKIK CRGVTIALAR
VQGAK