Gene Moth_2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2097 
Symbol 
ID3832463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2187480 
End bp2189390 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content65% 
IMG OID637830022 
ProductABC transporter related 
Protein accessionYP_430932 
Protein GI83590923 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4988] ABC-type transport system involved in cytochrome bd biosynthesis, ATPase and permease components 
TIGRFAM ID[TIGR02857] thiol reductant ABC exporter, CydD subunit 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGCTTA ATCATGACCT GTTACGTGAG GCCGGCAGGG TACGCCGCCA GCTGGCCCTT 
ACGGTGGAGC TGGGGTTGGG GGCCGGCCTC CTGGCCATTC TCCAGGCCTG GTTCCTGGCC
CGGGTGGTCA ACGGTGTCTT CCTGGAAGGG CGGGATCTGC CGGGGGTCTG GCCGTGGCTC
TTAATCCTTC TGGGGCTCAT TTTTCTGCGG GCGGCTTTCG CCTGGGGGAT GGAAGTGGCC
GGCCACCGGA CCGCGGCCCG GATCAAGTAT GACCTGCGTC GCCGCCTGGT GGCCCACCTC
CTGGCCCTGG GGCCGGTACC TTTAAAGGAT GGGCATACCG GGGAACTGGT TAACGTCCTG
GTTGAGGGGG TTGAGGACCT GGAGACCTAT TTTGCCGGCT ACCTGCCCCG CCTGGCCCTG
GCGGCCCTGA TGCCCATGGC TGTCCTGGGT TTTGTTTTTC CCCTGGACCT TTTCTCCGGC
CTGTTCCTCC TTGGCACCGC CCCCCTGCTC CCCCTCTTTA TGTTCCTTAT CGGCGGGCAG
GCGGAAAGGC TAACCAGCAA CCAATGGGAG ACCCTGGGCC GCCTGAGCGG CCACTTCCTG
GATGTGCTAC AAGGACTTAC CACCCTGAAA ATTTTCGGCC GCAGCAAGGC CCAGGCCGAG
GTCCTGGCCC GCCTCAGTGA CCGTTTCCGG TCCACCACCC TGGGAGTGCT ACGGGTGGCC
TTTCTCTCGG CCCTGGTCCT GGAGCTGGCG GCCACCTTAG GCACTGCCCT GGTGGCTGTT
TCTGTTGGCC TGCGCCTCCT CTACGGCCGC TTGCCCTTTC AGGAAGCCCT CTTCCTTTTA
TTGCTGGCGC CGGAATTCTA CCTGCCCCTG CGCCTCCTGG GCAGTCGGTA CCATGCCGGC
CTGGCCGGCG TTACCGCAGC CGCCCGTATC TTTGACCTCC TCTCCCGGCC CCTTCCCGCC
GGTGAAGGCG GTAAAGCAGG TAGCGCCGTC GGCCAGGATG AGGTGGTTGC GCCCTTCATA
GCGGGCCATC CAGGCAGGGC GACAGAGCTA TCGGAGGATA AAAAAATGGT GGCTGGCATT
TGTCTGAAGC ACGGGATCCG ATCCGACGGG GAAAGCATCC CTGGCGCCGC CGGAGATGGA
GGGCGCCCGG CTGGAGAAAA AGGCCTGCTC CGGCCCGGCC TTCATATTAT CCTGGAAGAT
GTTTATTATG CCTACGATCC GGGGAACCGG CCCGCCCTGC AGGGCCTTTC CCTGGAACTC
CGCCCCGGGG AAAAAGTGGC CCTGGTCGGA CCCAGTGGCA GCGGTAAAAG CACCGTCGCC
CACCTGCTCT TGCGCTTCCT GGAGCCCGAT CGCGGGCGCA TAACGGCCGA CGGCCATCCT
TTAGACCGGG TTCCCCCGGA AGATTGGCGC CGCCAGGTAT CCCTGGTGCC CCAGCATCCC
TACCTTTTCA GCGGCACTAT AGCCGACAAT ATCCTCCTGG GACGCCCGCA TGCCTCGTGG
GAGGAAATGG TGACGGCGGC CAGCCTGGCC GGCGCCCACG AGTTCATCAA CGCCCTGCCG
CAAGGTTACG CTACCCCCAT AGGCGAAAGG GGCTTGCGTT TGAGCGGCGG CCAGGCCCGG
CGCCTGGCCA TCGCCCGCGC CTTCCTGAAG GAGGCGCCCT TACTGATTCT CGATGAAGCT
ACCGCCGGCC TGGACCCGGC CACCGATCAG ATCATCCAGA CCGCCCTGGA GCGCCTCCTT
CGCGGGCGCA CGTCCCTGAT CATCGCCCAT CGTCTGAGCG CCGCCGTCCG GGCCGACCGC
ATCGTTGTCC TGGACTCCGG CAGGGTAATA GAGGAAGGGC GGCATGAAGA GCTCCTGGCC
CGCCGGGGTC TGTATTATCG CCTGGTTACG GCCTCCAGGG GGGCGGCGTA A
 
Protein sequence
MLLNHDLLRE AGRVRRQLAL TVELGLGAGL LAILQAWFLA RVVNGVFLEG RDLPGVWPWL 
LILLGLIFLR AAFAWGMEVA GHRTAARIKY DLRRRLVAHL LALGPVPLKD GHTGELVNVL
VEGVEDLETY FAGYLPRLAL AALMPMAVLG FVFPLDLFSG LFLLGTAPLL PLFMFLIGGQ
AERLTSNQWE TLGRLSGHFL DVLQGLTTLK IFGRSKAQAE VLARLSDRFR STTLGVLRVA
FLSALVLELA ATLGTALVAV SVGLRLLYGR LPFQEALFLL LLAPEFYLPL RLLGSRYHAG
LAGVTAAARI FDLLSRPLPA GEGGKAGSAV GQDEVVAPFI AGHPGRATEL SEDKKMVAGI
CLKHGIRSDG ESIPGAAGDG GRPAGEKGLL RPGLHIILED VYYAYDPGNR PALQGLSLEL
RPGEKVALVG PSGSGKSTVA HLLLRFLEPD RGRITADGHP LDRVPPEDWR RQVSLVPQHP
YLFSGTIADN ILLGRPHASW EEMVTAASLA GAHEFINALP QGYATPIGER GLRLSGGQAR
RLAIARAFLK EAPLLILDEA TAGLDPATDQ IIQTALERLL RGRTSLIIAH RLSAAVRADR
IVVLDSGRVI EEGRHEELLA RRGLYYRLVT ASRGAA