Gene Moth_1685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1685 
Symbol 
ID3833285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1723787 
End bp1724701 
Gene Length915 bp 
Protein Length304 aa 
Translation table11 
GC content49% 
IMG OID637829610 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_430530 
Protein GI83590521 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00261059 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.299717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCCAGA AATCCAAACG AGTGGGGAAG CTGATCGCGC TATTTACAGG CATCGCTATG 
TTGCTATTTG TAGCTGCCGG CTGCAGTGGA ACCAGGGCTA AAGGCACAGT GGTTGTAGGG
TCCAAGGACT TTACCGAAAA CATTCTCCTT GGCGAGATAA TGGCCCAGCT CATAGAAGCC
CATACGGACC TGAAGGTGGA ACGCAAATTG AACTTGGGCG GTACATTGGT TAACTTTAAC
GCCCTTAAAA AAGGCGACCT TGATCTCTAC GCTGACTACA CCGGTACCGG CCTAGTGGCA
ATCTTAAAAA GGGATGTTAT CAATGACCCC CAGGAGGCTT ACGATGCAGT TCAAAAGGCA
TACAACGAGC AGTTTAAGCT AAAGTGGCTG AAACCCTTTG GCTTTAATAA CACCTACGCC
CTTGCGGTAC CGGAGGAGGT TGCTCGACAG CGTAACTTAC AAAAAATATC CGACCTGAAA
AGCGTAGCCG GTGAGATGGT ACTCGGGGCC GAGCAGGAAT TTTTTAACCG CCCGGACGGC
TATGACGGCT TAATTGTCAC TTACGGGCTA AATTTCAAAA GCACCAAGCA GATGGAAACC
GGCTTAAAAT ACGAAGCCAT TCATAACAAG ATGGTAGATG TGATCGACGC CTTCGCCACC
GACGGCCAGT TGATTACCTA TAAGCTAAAG ATCCTGGAAG ATGATAAACA ATTCTTCCCG
CCCTACTTTG CTGCACCGTT GGTACGTATG GACACCCTCG AGAAGTATCC CCAGCTGGAA
GAAGTCCTGA ACAAGCTGGC GGGCCAGCTC AATGATGATG AGATGCGTCA GCTGAATTAT
CAGGTCGACG AGGAAAAAAA GGAAGTGGCC CAGGTGGCAA GAGATTTCCT GCTGAAAAAA
GGCCTGATCA AGTAA
 
Protein sequence
MRQKSKRVGK LIALFTGIAM LLFVAAGCSG TRAKGTVVVG SKDFTENILL GEIMAQLIEA 
HTDLKVERKL NLGGTLVNFN ALKKGDLDLY ADYTGTGLVA ILKRDVINDP QEAYDAVQKA
YNEQFKLKWL KPFGFNNTYA LAVPEEVARQ RNLQKISDLK SVAGEMVLGA EQEFFNRPDG
YDGLIVTYGL NFKSTKQMET GLKYEAIHNK MVDVIDAFAT DGQLITYKLK ILEDDKQFFP
PYFAAPLVRM DTLEKYPQLE EVLNKLAGQL NDDEMRQLNY QVDEEKKEVA QVARDFLLKK
GLIK