Gene Athe_2212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2212 
Symbol 
ID7408409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2342585 
End bp2343628 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content30% 
IMG OID643716580 
ProductC4-dicarboxylate transporter/malic acid transport protein 
Protein accessionYP_002574059 
Protein GI222530177 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1275] Tellurite resistance protein and related permeases 
TIGRFAM ID[TIGR00816] C4-dicarboxylate transporter/malic acid transport protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000186594 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGTA TCATCAAAAA CTTTTATCCG TCATGGTTTG TTGTTTGTAT GGGAACTGGA 
ATTATTACAA ATCTTTTCAA AGCTGTTGGC TATAACTTTT TGTCTATTAC TTTTGCCTTG
ATTAACATTG TATTTTTTGC AATAATCTTT TTAATCTGGT TTCTAAGATG GTTTATAGGA
TTTGAAAGTG TAAAAAAGGA TATAGAAAAT CCTCTTTTAT CAAACTATTT TGCTACAATG
CCAATTTCCC TTTTGATTGT AGGATTAAAT ATTTTAATTA ACCAGCAGTT CTTTGGGAAA
ATTTTTTCAG CCGCTTTTGC AAGGTATTCA TTCTATTTAG GAAGTTTTCT TATGATAATA
TTTTCAATAA CAACATTTTT AGTTCATCTT TCTCACAGAG AAATACCAGG TTCACTCTTA
AACTTTGCGT ATTTTATGCC ACCTGTTGGA AATATAATAG TACCTATACT TGGCAATGAA
ATCATAAACA GAAGTGTAGT TGATGGCAAT GAAAAAAGCA TAATAACCTT CATAAATTTG
ACTATGTTTG GAATAGGATT TATGCTATTT TTAAGTTATT TGCCGATTAT CAAAGGAAGG
TTTATTTTAC AAGAACCTAT TGAAAAAGGT CATTTTCCAA CAATGTTTAT TCTTCTTGCA
CCTGTTGGTG CCTCAATTGT TGCTCTAAAG GGCTTTGTGC AGAGCCTCAA ACTTGCAGGG
ATAATTGATA ATGCAGCTTT ACCAGCTTTG ATTAACGTAC TGACAACAAT ACTCTGGGGA
TTTGGATTTT GGATATTAAT GGCATTGGTT GTTTTACTGC TGAAAAATCT AAAAACAAAA
ATTCCATTTA GCCTTGCATA CTGGGCATAT ATTTTCCCTG TTGGTATATT TGTTCTTGCA
ACTTTTAAAG TTAATTCAAG TTACAATTTT TTGTTGTTAA ATCTTTTTGT AAAGTCTTTT
GTCTGGATTT TGTTTGTTGT CTGGGTATAT AACATTTTGC TGACTCTCAA AAATGTATTT
AACAAGAAAC TTTTAGTAAG ATAA
 
Protein sequence
MKSIIKNFYP SWFVVCMGTG IITNLFKAVG YNFLSITFAL INIVFFAIIF LIWFLRWFIG 
FESVKKDIEN PLLSNYFATM PISLLIVGLN ILINQQFFGK IFSAAFARYS FYLGSFLMII
FSITTFLVHL SHREIPGSLL NFAYFMPPVG NIIVPILGNE IINRSVVDGN EKSIITFINL
TMFGIGFMLF LSYLPIIKGR FILQEPIEKG HFPTMFILLA PVGASIVALK GFVQSLKLAG
IIDNAALPAL INVLTTILWG FGFWILMALV VLLLKNLKTK IPFSLAYWAY IFPVGIFVLA
TFKVNSSYNF LLLNLFVKSF VWILFVVWVY NILLTLKNVF NKKLLVR