Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2052 |
Symbol | |
ID | 7408265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2165718 |
End bp | 2167415 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643716419 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002573902 |
Protein GI | 222530020 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAGTT CAAAAAGGTT ACTTTCGATT TTATCAATTG TAGTGGTTAT AAGTTTTATA TTAGGAATTG GCATTATTGG AAATGCTGGA AGTTCAAAGC TTGTGAAGCC ACTCAAACCA ACACCCGAAG CAAAAAAGCC AATTACTCTC ACTATGTACA GTGCTGAAAC AAACCCAAAT GACGATGGAT TTAAGTCACC AGTTGCACAA AAGATAAAAG AACTTACAGG TGTTACATTA AAGATTGAGT ATGCCATAGC TCAAGGTGCT GGTCAACAAA AAATTCAGCT AATGGCTGCA AGTGGTGATT ATCCAGACCT TGTGTATGCA AAAGGAGACT TACAACTTCT TAAAAATGCT GGTGGTATTG TACAGTTAGA TAGTTTAATA GAAAAGTATG GTCCCAACAT TAAAAAAGCA TATGGAAAAA ATCTCAAGAG GCTTAGATGG AGTCCTCAGG ATCCGCATAT ATACTGTTTG GGGATAACAA CAGATAATGA TGCAACACTT GATGTAAATG GTGGATTTAT GGTTCAGCAC AGAGTAGTAA TAGAACAAAA TTATCCCAAG ATTAGAACAA TAAAAGATTT TGAAAATGTA ATAGTAAATT ACTGGAAAAA ACATCCTACA ACAGACGGAC TTCCAACCAT TCCTTTGACA CTTAGTGCTG ACGATTGGCG AACGGTTATT TCTGTTACAA ATCCAGCTTT TCAGGCAACA GGTGCACCTG ACGATGGAGA GTTTTATGTT GACCCAAAAA CTTTGAAAGT GATAAGACAT TATAAACGTC CAATTGAAAA AGAATATTTC AAATGGTTAA ATCACTTATG GAACGCAGGA ATTCTTGATA GAGAGACATT TGTCCAGAAA GACGACCAGT ATAAAGCTAA AATAGCATCT GGAAGAGTTC TTGCTTTAAT TGACGCAGGT TGGGCAGTGG GTGAACCAAT CACTGCTCTC AAAAAAGCAG GCAAATACGA ATACACATAC GGCTATTATC CTGTTACAGT TAATGAAAAG ATAAAACAAT GTCCGCCTGA TGTAAAAGTT GGATACACAG GTGGCTGGGG TGTTGCTATA ACAGTAAAGT GCAAAGATAA GGTGAGAGCA ATTAAGTTCC TTGACTGGAT GTGCACCGAA GATGCTAATA TCTTAAGACA ATGGGGTATT GAAGGTGTTC ATCACACATA TATAAATGGT AAGAGAGTAT TTACACCAAA ATATGACCAG ATGAGAAAAA CTGATCCTAC ATTTGGAAAA AAGACTGGGA TAGGTCCTTA CATTTACCCG TTCCCGAGAC TGCCTAATAC GTATATTGAT TCAACAGGAA ATCCAATTGC ACCTGACACG AGAAAAGAAG ATATAAGAAA GAACTATAGC GATGTTGAGA AGAAAGTATT GTCTGCATAT AAAGCAGAGA TTTGGAAAGA CTTATTCCCA AAATCAAATG AGTATCCAGA AAAAACATGG GGTTATCTCT GGATGATTTC AATTGATGAT CCTAATATCA AAACAATTAA CGATAAAATC TGGAATTATA CACTTTCGAC CATTCCAAAA GTTGTAATGG CAAAAGAAAA AGACTTTGAT AAGGTATGGA ACGAATTTTT GGATGGTTTT GAGAAGCTTG GAAACAGCAA GGTTGAAGAA TATTATACAA AAAGAATCAA GCAAAACATT GAATTGTGGA CAAAATAA
|
Protein sequence | MRSSKRLLSI LSIVVVISFI LGIGIIGNAG SSKLVKPLKP TPEAKKPITL TMYSAETNPN DDGFKSPVAQ KIKELTGVTL KIEYAIAQGA GQQKIQLMAA SGDYPDLVYA KGDLQLLKNA GGIVQLDSLI EKYGPNIKKA YGKNLKRLRW SPQDPHIYCL GITTDNDATL DVNGGFMVQH RVVIEQNYPK IRTIKDFENV IVNYWKKHPT TDGLPTIPLT LSADDWRTVI SVTNPAFQAT GAPDDGEFYV DPKTLKVIRH YKRPIEKEYF KWLNHLWNAG ILDRETFVQK DDQYKAKIAS GRVLALIDAG WAVGEPITAL KKAGKYEYTY GYYPVTVNEK IKQCPPDVKV GYTGGWGVAI TVKCKDKVRA IKFLDWMCTE DANILRQWGI EGVHHTYING KRVFTPKYDQ MRKTDPTFGK KTGIGPYIYP FPRLPNTYID STGNPIAPDT RKEDIRKNYS DVEKKVLSAY KAEIWKDLFP KSNEYPEKTW GYLWMISIDD PNIKTINDKI WNYTLSTIPK VVMAKEKDFD KVWNEFLDGF EKLGNSKVEE YYTKRIKQNI ELWTK
|
| |