Gene Moth_1697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1697 
Symbol 
ID3833297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1735263 
End bp1736498 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content60% 
IMG OID637829622 
Producthypothetical protein 
Protein accessionYP_430542 
Protein GI83590533 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.600188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGCG CTGAACGCGG TATAGTCATG TCCAGGGAGG GGCAGAGGGT CATCGTTTTA 
ACGCCCCGGG GCGACTGGCG GGCTTTAAAA TTGGCCGGAC CCCTGCCGGA AGTGGGGGAA
GAAATTATGC TGCCGCCGGT GGTCAAGAGG CCGGCGTGGC CCCTCTTTAC TGCCGCCGCA
GTAGTTCTTT TACTGGTCCT GGCCGGGGCT GTAGTGCGCC GGATAGAAAC CCCGGCGCCG
GCGGCTGCCC CTATGGTAGC CTATTATGTT AATATAGATA TTAACCCCAG CGTAGAGCTG
GCCGTGGATG AAAAGGATAC CGTCCTCGAG GCCCGCGGCC TGAACAACGA CGGGGAAAAA
TTACTCGCCG GCATTGCCCT GAAGGGGGAA AAGGTTACCG GGGCTATGAA GATTCTGGCC
CTGGAGGCCC TGCGCCAGGG TTATTACCTT CCCGAGGGGG AAGGCGCCAT GATGGTAACA
GTAATCCCTG CCGGGTCCGG CCAGGAGAAA CTGGCGGCCG GGGATGAACT GGGCCAGAGG
CTCACCCGCG AAGCGCAGGA TGTCTTTCAG CAGGCCGGGG TACATGCTGC CGTAGAGGCA
GCTACCGTCC AGCCGGAGAT CCGCCAGCAT GCCGAGGCCG CCGGCCTCTC GGCCGGTAAG
TACAGCATTA TGCTGGAGGC CCTGGCAGCC GGGGCCCAGG TAAAGGCTGT CGATTTGCAG
CGGGAGAGTA TTACTAAAGT CCTGCAGGAA CTCAATTTTA ACTGGGAAGA TGTGCTTGCC
CGGCTAAAAA GGGATCCTGA CCTGTTAAAG AGGGAAGAGC AACTGGGACC GGTCCTGAAG
GCGGCCCTGG GTCAGGGCCC ACTCCCCGCG GAAAACGGAA ATAGCCAGGG TAATGCTCCC
GCCAGGGGTC CGGCAGCAGC ACCGGCTAAT AAGCCCGACC AGGGGGATAA TCAGGAAACC
CGGCAAGGCA AACAGGAAAC CACCGGCCAG GGGAGGGAGA TGAACTCGAA CCGTGGCCAG
TCCTCCAGCC GGGATGGTAT GGCAGTAGCC TGGCAGCTAC GAGCCCGGCT CAAGGCCGGA
CTGCAAAACC AGCCGGGGGG ACCGGTCCTC AAAGAACTGC CGGTACTAGA ACATGTCCCC
GGGAAAAACC TGCGGGACGT GCTGGCAAAA ATAAAATTGG AAGACGTGGT TTTAAAGAAA
ATAGAGGAAA AACGGCAGGA TATGGCGAAA AGATAG
 
Protein sequence
MDGAERGIVM SREGQRVIVL TPRGDWRALK LAGPLPEVGE EIMLPPVVKR PAWPLFTAAA 
VVLLLVLAGA VVRRIETPAP AAAPMVAYYV NIDINPSVEL AVDEKDTVLE ARGLNNDGEK
LLAGIALKGE KVTGAMKILA LEALRQGYYL PEGEGAMMVT VIPAGSGQEK LAAGDELGQR
LTREAQDVFQ QAGVHAAVEA ATVQPEIRQH AEAAGLSAGK YSIMLEALAA GAQVKAVDLQ
RESITKVLQE LNFNWEDVLA RLKRDPDLLK REEQLGPVLK AALGQGPLPA ENGNSQGNAP
ARGPAAAPAN KPDQGDNQET RQGKQETTGQ GREMNSNRGQ SSSRDGMAVA WQLRARLKAG
LQNQPGGPVL KELPVLEHVP GKNLRDVLAK IKLEDVVLKK IEEKRQDMAK R