Gene Athe_2753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2753 
Symbol 
ID7408323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2902907 
End bp2904052 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content34% 
IMG OID643717109 
Productphosphate binding protein 
Protein accessionYP_002574578 
Protein GI222530696 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID[TIGR02136] phosphate binding protein 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG TATTCTTTAA AAAGGTTATT ACTCTAATAG CACTTTATTG CTTTATGTCA 
ACAATATGTT TAATCCCATT AATGTCCTAC TCACAAAGTC TAACAGTAAA GTCAAAATCA
CTTCCCTCAT CAACAACACA AAAACAGATT ATTATTACAT TTTCACAGGA TATCTTAAAA
GGTCCAAATT TTGATAAGAT AACACTTTTA AAAAACAAAA AATCAAAGGT ACAATTCTCA
GCTCAGGTTT CATATAACAA GCTTGTCATA ACAATAAAAG AAAATCTTAC TCCAAAAGCT
CAGTATCTAT TGACAATTCC AAAAAATGCT CTAAAGTCAG CTAAAGGAGA TTATAACCCA
GAACTTAAGT ACACATTTAT TCCACAAACT TTTTCAACAA ACCTTTCTGG AAGAATTATG
ATTGCAGGGT CAACATCTGT TCAGCCACTT GCTGATGAAC TTGCAAAATA TTTTATGCAG
CAATATCCAA AAGTATCAAT TGAGGTTCAA GGTGGAGGCT CATCAGTGGG AATAAAATCT
GCTATTCAAG GAATTGTAGA CATTGGAACA TCATCAAGAG AACTGACAGA GGATGAATCA
AAACAGCTAT CAGCAAAAGG CTGGCAAGAG ATAAAAATTG CAGAAGATGG CATTGCAGTT
ATTGTTCACA AATCCAATCC TGTGTCAAAC CTCTCAATTG AACAAATTAG AGACATATTC
TCTGGCAAGA TTAAAAACTG GAAAGAGGTT GGCGGTAAAG ACGCTAAAAT AGTTGTTGTC
ACAAGAGAAG AAGGTTCTGG TACAAGAGGC GCGTTTGAAG AAATAGTTAT GGGAAAATCA
TCAAAGATAA CAGACTCAGC AATTGTCCAG CCATCAACTG GTGCTGTAAA AACAACAGTT
TCACAGGATG AAAATGCAAT TGGATATATA TCAATTGGCG TATTAGATAG CACAGTAAAA
GGTGTCAAGG TTGAAGGTGT TGAACCATCA GAAAAGAACG TAAAGCTCGG AAAATACAAA
ATTAAAAGAC CATTTCTCTT CTTAGTTTCC AAAAATCCAA GCAAGGTAAC AAAAGCATTT
GTTGATTTTG TCCTCTCTGA TGAAGGTCAG GCAATTGTAG CTAAAAACTA TATCTCAGTT
AAGTAA
 
Protein sequence
MKKVFFKKVI TLIALYCFMS TICLIPLMSY SQSLTVKSKS LPSSTTQKQI IITFSQDILK 
GPNFDKITLL KNKKSKVQFS AQVSYNKLVI TIKENLTPKA QYLLTIPKNA LKSAKGDYNP
ELKYTFIPQT FSTNLSGRIM IAGSTSVQPL ADELAKYFMQ QYPKVSIEVQ GGGSSVGIKS
AIQGIVDIGT SSRELTEDES KQLSAKGWQE IKIAEDGIAV IVHKSNPVSN LSIEQIRDIF
SGKIKNWKEV GGKDAKIVVV TREEGSGTRG AFEEIVMGKS SKITDSAIVQ PSTGAVKTTV
SQDENAIGYI SIGVLDSTVK GVKVEGVEPS EKNVKLGKYK IKRPFLFLVS KNPSKVTKAF
VDFVLSDEGQ AIVAKNYISV K