Gene Athe_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2052 
Symbol 
ID7408265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2165718 
End bp2167415 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content36% 
IMG OID643716419 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002573902 
Protein GI222530020 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAGTT CAAAAAGGTT ACTTTCGATT TTATCAATTG TAGTGGTTAT AAGTTTTATA 
TTAGGAATTG GCATTATTGG AAATGCTGGA AGTTCAAAGC TTGTGAAGCC ACTCAAACCA
ACACCCGAAG CAAAAAAGCC AATTACTCTC ACTATGTACA GTGCTGAAAC AAACCCAAAT
GACGATGGAT TTAAGTCACC AGTTGCACAA AAGATAAAAG AACTTACAGG TGTTACATTA
AAGATTGAGT ATGCCATAGC TCAAGGTGCT GGTCAACAAA AAATTCAGCT AATGGCTGCA
AGTGGTGATT ATCCAGACCT TGTGTATGCA AAAGGAGACT TACAACTTCT TAAAAATGCT
GGTGGTATTG TACAGTTAGA TAGTTTAATA GAAAAGTATG GTCCCAACAT TAAAAAAGCA
TATGGAAAAA ATCTCAAGAG GCTTAGATGG AGTCCTCAGG ATCCGCATAT ATACTGTTTG
GGGATAACAA CAGATAATGA TGCAACACTT GATGTAAATG GTGGATTTAT GGTTCAGCAC
AGAGTAGTAA TAGAACAAAA TTATCCCAAG ATTAGAACAA TAAAAGATTT TGAAAATGTA
ATAGTAAATT ACTGGAAAAA ACATCCTACA ACAGACGGAC TTCCAACCAT TCCTTTGACA
CTTAGTGCTG ACGATTGGCG AACGGTTATT TCTGTTACAA ATCCAGCTTT TCAGGCAACA
GGTGCACCTG ACGATGGAGA GTTTTATGTT GACCCAAAAA CTTTGAAAGT GATAAGACAT
TATAAACGTC CAATTGAAAA AGAATATTTC AAATGGTTAA ATCACTTATG GAACGCAGGA
ATTCTTGATA GAGAGACATT TGTCCAGAAA GACGACCAGT ATAAAGCTAA AATAGCATCT
GGAAGAGTTC TTGCTTTAAT TGACGCAGGT TGGGCAGTGG GTGAACCAAT CACTGCTCTC
AAAAAAGCAG GCAAATACGA ATACACATAC GGCTATTATC CTGTTACAGT TAATGAAAAG
ATAAAACAAT GTCCGCCTGA TGTAAAAGTT GGATACACAG GTGGCTGGGG TGTTGCTATA
ACAGTAAAGT GCAAAGATAA GGTGAGAGCA ATTAAGTTCC TTGACTGGAT GTGCACCGAA
GATGCTAATA TCTTAAGACA ATGGGGTATT GAAGGTGTTC ATCACACATA TATAAATGGT
AAGAGAGTAT TTACACCAAA ATATGACCAG ATGAGAAAAA CTGATCCTAC ATTTGGAAAA
AAGACTGGGA TAGGTCCTTA CATTTACCCG TTCCCGAGAC TGCCTAATAC GTATATTGAT
TCAACAGGAA ATCCAATTGC ACCTGACACG AGAAAAGAAG ATATAAGAAA GAACTATAGC
GATGTTGAGA AGAAAGTATT GTCTGCATAT AAAGCAGAGA TTTGGAAAGA CTTATTCCCA
AAATCAAATG AGTATCCAGA AAAAACATGG GGTTATCTCT GGATGATTTC AATTGATGAT
CCTAATATCA AAACAATTAA CGATAAAATC TGGAATTATA CACTTTCGAC CATTCCAAAA
GTTGTAATGG CAAAAGAAAA AGACTTTGAT AAGGTATGGA ACGAATTTTT GGATGGTTTT
GAGAAGCTTG GAAACAGCAA GGTTGAAGAA TATTATACAA AAAGAATCAA GCAAAACATT
GAATTGTGGA CAAAATAA
 
Protein sequence
MRSSKRLLSI LSIVVVISFI LGIGIIGNAG SSKLVKPLKP TPEAKKPITL TMYSAETNPN 
DDGFKSPVAQ KIKELTGVTL KIEYAIAQGA GQQKIQLMAA SGDYPDLVYA KGDLQLLKNA
GGIVQLDSLI EKYGPNIKKA YGKNLKRLRW SPQDPHIYCL GITTDNDATL DVNGGFMVQH
RVVIEQNYPK IRTIKDFENV IVNYWKKHPT TDGLPTIPLT LSADDWRTVI SVTNPAFQAT
GAPDDGEFYV DPKTLKVIRH YKRPIEKEYF KWLNHLWNAG ILDRETFVQK DDQYKAKIAS
GRVLALIDAG WAVGEPITAL KKAGKYEYTY GYYPVTVNEK IKQCPPDVKV GYTGGWGVAI
TVKCKDKVRA IKFLDWMCTE DANILRQWGI EGVHHTYING KRVFTPKYDQ MRKTDPTFGK
KTGIGPYIYP FPRLPNTYID STGNPIAPDT RKEDIRKNYS DVEKKVLSAY KAEIWKDLFP
KSNEYPEKTW GYLWMISIDD PNIKTINDKI WNYTLSTIPK VVMAKEKDFD KVWNEFLDGF
EKLGNSKVEE YYTKRIKQNI ELWTK