Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1913 |
Symbol | |
ID | 7407326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2017642 |
End bp | 2019249 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643716285 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002573774 |
Protein GI | 222529892 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.988615 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGC GTATTGTGGC TACTTTTATT TTAGTTGCGT TCCTTGTGAC AGGGTTATTT TTAGGTACAA ATTACAAAAA TGCAACAGCT GCTTCATCTA AACAGGTGCT TACTTATATT AATGGTGCTG AACCAAGATA TCTTGACCCA GCTTTAAATA CTGCGCTTGA TGCAGCAAAT ATTATTATTA ATGTTTTTGA AGGCTTGACA AGGGTTGATG TAAAAGGAAG AACTGTCCCA GGAATGGCAG AAAAATGGAC AGTATCAAAG GATGGACTTA CTTATACTTT CTATATAAGA AAGAATGCAA AGTGGTCAGA CGGTAAACCT GTTACAGCAT ACGATTTTGA GTATGCTTGG AAAAGAGCAT TAGATCCGAA AACAGGTTCA GAATATGCTT ATCAGCTTTT CTACATCAAA AATGGTCAAA AATTCTATGA AGGTAAAGCA AAAGCATCTG ATGTTGGTGT CAAAGCTTTA AATGCTACAA CTTTACGGGT TACATTGGAA GCACCAACAC CATACTTTAT TGATCTTACA AACTTCCCAA CATACTTCCC AGTAAGAAAA GATATTGTTG AGAAGTATGG TGATAAATGG CAAACAGATC CAAAGACATA TATTGGAAAT GGTCCATTTA AAATGACAAA ATGGGTTCAT AATTCTTATA TTGAACTTAC TAAGAACACC AATTACTGGG ATGCAAAGTC AATTACTTTA CAAAAGATGG TGCTTAAATT ATCATCTGAT AACAATGCTA ATTTGATGGC TTTTACTGCA GGGCAAGTTG ATGGTGCCGA AGGTATACCA ACCGAAGAAA TTCCAAGACT TAAAAAAGAA GGAAAACTTA AAATAGCACC TTTATTAGGA ACATACTACT ATGATGTAAA TTGCAAGAAA GCACCTTTTA ATGATAAGAG AGTAAGAGAG GCTTTATCAC TTGCTATTGA TAGGACACGT ATTGTAGCTC TTCTAAAAGG TGAACAAAAG CCAGCAACAG GTTTTGTTCC ATATGGTGTT AAAGGAATTT CTAAAGATTT CAGAAGTGAA GCAGGTAATT ACTTACCAGT GAATGCTGAT TTAGCAAAAG CAAAGAAACT GTTAGCTGAA GCAGGGTATC CAAACGGAAA GAATTTCCCA GATATTGAGA TTATTTATAA TACTGATGAA GGACATAAAA AAGTTGCCGA AGCTATTCAA AATATGTGGA AACAACTTGG AATAAATGTT AAACTTTCTA ATATGGAATG GAAAGTATTG CTTGAAAGAA GACAGAAAAA AGACTATATA GTAGCAAGAG ATGGATGGGT TGGCGATTAT AACGATCCGA TGACTTTCTT AGATTTGTTT ACCTCATATA GTGGCAATAA TAACACAAAT TGGAGCAATA AGCAATATGA TTCTCTAATA GATAAGGCTA AGAAGACCAT TGATGCTAAA CAAAGAATGC AGTATATGAT ACAAGCAGAA AAAATATTGA TGCAAGACCA TGCAATAATT CCAATCTATT TCTATACAAA AGTTTATCTT CTGAGAGACT ATGTAAAGAA TTACTATATT TCTCCACTTG GATTTAACTA CTTCATGTAT GCCAAGATTG TAAAGTAA
|
Protein sequence | MKKRIVATFI LVAFLVTGLF LGTNYKNATA ASSKQVLTYI NGAEPRYLDP ALNTALDAAN IIINVFEGLT RVDVKGRTVP GMAEKWTVSK DGLTYTFYIR KNAKWSDGKP VTAYDFEYAW KRALDPKTGS EYAYQLFYIK NGQKFYEGKA KASDVGVKAL NATTLRVTLE APTPYFIDLT NFPTYFPVRK DIVEKYGDKW QTDPKTYIGN GPFKMTKWVH NSYIELTKNT NYWDAKSITL QKMVLKLSSD NNANLMAFTA GQVDGAEGIP TEEIPRLKKE GKLKIAPLLG TYYYDVNCKK APFNDKRVRE ALSLAIDRTR IVALLKGEQK PATGFVPYGV KGISKDFRSE AGNYLPVNAD LAKAKKLLAE AGYPNGKNFP DIEIIYNTDE GHKKVAEAIQ NMWKQLGINV KLSNMEWKVL LERRQKKDYI VARDGWVGDY NDPMTFLDLF TSYSGNNNTN WSNKQYDSLI DKAKKTIDAK QRMQYMIQAE KILMQDHAII PIYFYTKVYL LRDYVKNYYI SPLGFNYFMY AKIVK
|
| |