Gene Moth_2098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2098 
Symbol 
ID3832464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2189551 
End bp2191380 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content65% 
IMG OID637830023 
ProductABC transporter related 
Protein accessionYP_430933 
Protein GI83590924 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4987] ABC-type transport system involved in cytochrome bd biosynthesis, fused ATPase and permease components 
TIGRFAM ID[TIGR02868] thiol reductant ABC exporter, CydC subunit 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACCCC CGCCTCCCTC TGCTTCTCCC GCCGCCAGGG AGAATACAGA TCCCGGGGGT 
CACCGAGCAG ACTCTGGGGT TTCATATTCA AAGATTGACC CGGTCGGCAC CTTCGTCCGC
TTGCTGGGTT TAATAGCACC GGCCTGGCAG GCCGTCCTCG GCGCCACCCT TTTGGGTTCC
GGTACCATCG CCAGTAACAT TGGCCTCATG GCCACGGCTG CCTTCCTTAT CGCCAGCGCT
GCCCTCCACC CGCCGGCCGG AAAACTAATG CTGGCCATAG TCGGGGTGCG CTTTTTCGGC
ATCACCCGGG CCGTATGTCG CTACCTGGAA CGTTATGTAA ACCATAGTAT AACCCTTGGC
ATCCTGGGCC GGTTGCGGGT TGCCTTCTAC CGGACCCTGG AACCCCTGAT CCCGGCCGGT
TTGCAGGGCC ATCACAGCGG GGATTTGCTT AGCCGCGCTG TAGCCGACGT TGCCACCCTG
GAGAATTTTT ACCTTCGCGT CCTGAACCCA CCCCTGGTCG CCCTGCCGGT CGCCGCCGGG
GTTTTCCTGT TTTTGGCCCA TTTCGGCCGG ACCCTGGCCC TGGCCTGGAT GGGCGCTTTC
CTGGCCGCCG GGGTTATCTT CCCCGTGGGC GTCACAATTG TCGGCCGGGG CGTCATGCGG
CGCCAGGGCG AGGCCCGGGC GGCCCTGAAT ACCGCCCTGG TGGACACCGT CCAGGGCCTG
GCCGACATCC TGGTCTTCGA TCATGGGCGG CAACAACAGG AGTACATCGC CACCCTGGAC
CGTCAGTACC TGCATCTCCA GGGCCGTAAA GCCGGCCTGA ACGGTTTGGC GAACGCCCTC
ACCAGCCTGG CAAGCAACCT GGCCCTGTGG GCCGTCCTGG TACTGGCCAT CCCCCTGGTA
AACAGGGGAC AAATTGACGG CGTCTACCTG GCCATGCTGG CCCTGACGGC CGCCGCCGCC
CTGGAAGCCG CCAAGCCCCT GCCCATGCTC TTCCCCCACC TGGAAGGGAG CCTGGCCGCC
GCCCGCCGTA TCTTCGCCCT TAGCGACACC CGACCCGCCG GGGACCCGGC CGGCCCCGTT
CCCCACCCCC GGGACTTTTC CCTCCGGGTC CAGGGACTGC GTTTCCGCTA CGGCCCCGGG
GAACCACCGG CCCTGGACGG CATCGATTTT GACGTCCCCT CCGGAGCGCG GATAGCCATC
GTCGGCCCCA GCGGCGCGGG CAAAAGTACC CTGGTTAATT TGCTCCTGCG CTTCTGGGAC
TATGAAGAAG GAGCCATACT CCTGGGTGGC TACGACCTGA AGGCCTATCC ACCGGAGGAG
CTACAGCGTT TCATCGGAGT TGTGGCCCAG CCAACCCATC TCTTTCACGC CACCATCGCC
GAAAACCTGC TCCTGGCCCG ACCGGACGCG ACCCGGGAGG AGATGGAGCG GGCGGCCCGG
GAAGCCCGGC TGCATGAGTT TATCCAGGTC CTGCCCCGGG GCTACGACAC CCTGATCGGC
GAAGAAGGCT TTAAGCTCTC CGGCGGCCAG CGCCAGCGAC TGGCCATAGC CCGGGCCTTG
CTGCAAAACG CCCCCATCCT CATCCTCGAT GAGGCTACGA CCGGCCTGGA TGCCGTAACA
GAACGAGAGG TAATGGATTC CATCCGCCAC CTGATGGAGG GGCGCACCAC CCTGGTCATC
ACCCACCGCC TGGTGGGCCT GGAAGACATG GATAAAATCC TGGTCCTTGA CAGGGGCAGG
TTGGTCCAGC AGGGGCGGCA TGCAGAACTT CTCCGGCAGG AGGGCCTTTA CCGCCATTTG
TGGCAACTGC AGCAGGAGGC GCTACCCTAA
 
Protein sequence
MLPPPPSASP AARENTDPGG HRADSGVSYS KIDPVGTFVR LLGLIAPAWQ AVLGATLLGS 
GTIASNIGLM ATAAFLIASA ALHPPAGKLM LAIVGVRFFG ITRAVCRYLE RYVNHSITLG
ILGRLRVAFY RTLEPLIPAG LQGHHSGDLL SRAVADVATL ENFYLRVLNP PLVALPVAAG
VFLFLAHFGR TLALAWMGAF LAAGVIFPVG VTIVGRGVMR RQGEARAALN TALVDTVQGL
ADILVFDHGR QQQEYIATLD RQYLHLQGRK AGLNGLANAL TSLASNLALW AVLVLAIPLV
NRGQIDGVYL AMLALTAAAA LEAAKPLPML FPHLEGSLAA ARRIFALSDT RPAGDPAGPV
PHPRDFSLRV QGLRFRYGPG EPPALDGIDF DVPSGARIAI VGPSGAGKST LVNLLLRFWD
YEEGAILLGG YDLKAYPPEE LQRFIGVVAQ PTHLFHATIA ENLLLARPDA TREEMERAAR
EARLHEFIQV LPRGYDTLIG EEGFKLSGGQ RQRLAIARAL LQNAPILILD EATTGLDAVT
EREVMDSIRH LMEGRTTLVI THRLVGLEDM DKILVLDRGR LVQQGRHAEL LRQEGLYRHL
WQLQQEALP