Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1942 |
Symbol | |
ID | 7407356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2049187 |
End bp | 2049987 |
Gene Length | 801 bp |
Protein Length | 266 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643716314 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_002573802 |
Protein GI | 222529920 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000237139 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATAAAA AGGTTATAGC TTTGGTTTTG CTAATAGCAC TTTTTATCCC GTTTTTAAGC GGATGTACTT CAAATGACCA AAACATGACA ACCTTAGAAA AGATAAAAAA GACAAAAGAG TTTGTTGTTG GTATGGACAA CACATTTCCA CCAATGGAAT TTACTGATGA TAACAACAAC ACAGTGGGAT TTGATGTGGA CTTAGCAAAC GAAATAGCAA AAAAGCTTGG TGCAAAGCTA AAGATTGTTG CTGTTGACTG GAGTGGAATC CAGAGCGCTT TAAAGTCCAA AAAGTTTGAT GCTATTATTT CATGCTTTAG TATAACAGAT GAGAGAAAGA AAGCTTTCAA TTTAGCGGGG CCATATCTTT ACATTCGTCA GGTTATTGCT GTGAAAAAGG GCGACAGCTC AATCAAAAGT TTTGAAGATT TAAAAGGGAT TAAGATAGGC GTTCAAGCAA ACACAACAGG TGACAATGCT GTTCAAAAGA TGAAATTTAT AAACTATGAA AAGGATGTCA CACGATACGA AAGAATAACT GATGCTTTCA ACGACCTTGA CATTGGGAGA ATAAAAGCAG TTGTGATAGA CAGTGTTGTT GCTTACTATT ATAAAAAACA AAATCCTGAA AAGTTTGACA TAGCACCTGC ACAGCTTGAA AGAGAACCTG TGGGAATAGC TCTCAGAAAA GAGGACAAGG ACCTGTACAA CGAAATTCAA AAGATTTTAG ACCAGTTAAA AAAGGACGGG ACTATTGCGA AAATATCTGA AAAATGGTTT GGAGAAGACA TTACAAAGTA A
|
Protein sequence | MYKKVIALVL LIALFIPFLS GCTSNDQNMT TLEKIKKTKE FVVGMDNTFP PMEFTDDNNN TVGFDVDLAN EIAKKLGAKL KIVAVDWSGI QSALKSKKFD AIISCFSITD ERKKAFNLAG PYLYIRQVIA VKKGDSSIKS FEDLKGIKIG VQANTTGDNA VQKMKFINYE KDVTRYERIT DAFNDLDIGR IKAVVIDSVV AYYYKKQNPE KFDIAPAQLE REPVGIALRK EDKDLYNEIQ KILDQLKKDG TIAKISEKWF GEDITK
|
| |