Gene Athe_1916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1916 
Symbol 
ID7407329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2021362 
End bp2022339 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content40% 
IMG OID643716288 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_002573777 
Protein GI222529895 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00471699 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAGCG TATCAAAAGA ACTTTTTGTA CCAGTTCCGA AAGAAGAGAG GCAGCAAGAG 
ACAATTGTCA GGCCAAGCAT GAGCTACTGG CAGGATGCAT GGAGAAGACT TAAGGCTAAT
AAAGTAGCAA TGGCATCAAT GTGGACAATA GTGTTTTTTA TTTTGCTTGC CATAATTGGT
CCAATAGTTA TGCCATACAA ATATGATCAG CAGATTAGAG GGCATGAAGC ACTGCCACCG
TCACTTACTC ATTTATTTGG AACTGATGAG CTTGGTAGAG ATTTGTTTGT AAGATGCTTG
TATGGTATGA GAATCTCTCT TTCCATAGGA ATTGTTGCAA CAATTATAAA TATTGTGATT
GGTGTTTTAT ATGGGGGCAT CTCGGGGTAT ATAGGTGGCA GAGTTGACAA TATAATGATG
AGAATAGTTG ATATCCTGTA CAGTATACCT TTGATGATTT ACGTAATTCT TCTTTCAGTA
TCGTTAAAGC CTGCTTTGGA AGCTCTTTTT GATAAGTATT CATTTTTGAG CGGACTTCAG
ACAGTGGGTG CACCACTTGT TTGTATATAC ATTGCATTGG GACTTACTTA CTGGATTTCG
ATGGCGAGGA TTGTGCGTGG AGAGATATTA AGCTTAAAAC AGCAAGAATA TGTTACAGCC
GCAAAAACAA TTGGTGCAAG TGGTTGGAGG ATTTTGCTCA GGCACCTGAT TCCAAACAGC
ATGGGGTCAA TTATAGTCAC TGCTACGCTG CAGATTCCAA GTGCCATTTT TACTGAGTCT
TTTTTGAGCT TCATTGGTCT TGGTGTTGAT GCACCTGTTC CATCACTTGG TTCTTTGGCA
TCAGATGGTG TTAACGGTTT TATATCATAC CCTTATAGGC TATTTTTCCC ATCGCTTTTG
TTGTGTTTGA TAATACTTGC ATTCAACTTG TTTGGGGATG GGCTCAGAGA TGCACTTGAT
CCAAGAATGA GAAAGTAA
 
Protein sequence
MESVSKELFV PVPKEERQQE TIVRPSMSYW QDAWRRLKAN KVAMASMWTI VFFILLAIIG 
PIVMPYKYDQ QIRGHEALPP SLTHLFGTDE LGRDLFVRCL YGMRISLSIG IVATIINIVI
GVLYGGISGY IGGRVDNIMM RIVDILYSIP LMIYVILLSV SLKPALEALF DKYSFLSGLQ
TVGAPLVCIY IALGLTYWIS MARIVRGEIL SLKQQEYVTA AKTIGASGWR ILLRHLIPNS
MGSIIVTATL QIPSAIFTES FLSFIGLGVD APVPSLGSLA SDGVNGFISY PYRLFFPSLL
LCLIILAFNL FGDGLRDALD PRMRK