Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1476 |
Symbol | |
ID | 7976922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1550942 |
End bp | 1551802 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644798380 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_002949553 |
Protein GI | 239826929 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAA AAATAGTTTG GAAAATGTGG ATCACGCTTG CGCTTATCGC ACTATTAAGC ATCACTGCTT TAGCCGGATG CAGCAGCGAA TCGTCAACAT CCAACAAAGG GGATGGGGCA AAATCAACAG AAACGTCGGG CGGAACGAAC ACGTTAGAAA AAATTAAAAA ACGCGGCAAA CTCGTTGTTG GAGTAAAGTA TGACTTGAAC TTGTTCGGTT TAAAAAATCC AGAAACCGGA AAAGTAGAAG GGTTTGATAT TGACATTGCC AAAGGATTAG CGAAAAAAAT TCTTGGTGAT GAAAATAAAA TCGAACTAAA AGAAGTGACA TCCAAAACAC GCATTCCAAT GCTCAATAAC GGAGAAATCG ATGCGATTAT CGCGACAATG ACCATCACAG AAGAACGGAA AAAAGAAGTT GATTTCTCTG ATGTGTATTT CATGGCCGGA CAATCGTTAC TTGTCAAAAA AGACAGCAAA ATTAACAGCG TAAAAGATTT GAAAAAAGGA ATGACCGTGT TAACGGCAAA AGGTTCTACA TCCGCGCAAA ACATCCGCAA AGTAGCGCCA GAAGTCAATG TATTAGAATT TGAAAACTAT GCTGAAGCGT TTACAGCGCT AAAAGCTGGG CAAGGCGATG CGCTCACAAC GGACAATGCT TTGCTTTTGG GAATGGCAAA ACAAGATCCA AACTACCGCG TTCTTGACGA AACGTTTACC GAAGAACCAT ACGGCATCGC CGTCCGCAAA GGAGACAAAG AATTTTTGCA AGTCATTAAC GAATACTTAA AAGAAATTAA AGAAAACGGC GAATACGACA AAATTTATGA AAAATGGATT GGGAAAAAAC CGCAACAATA A
|
Protein sequence | MKRKIVWKMW ITLALIALLS ITALAGCSSE SSTSNKGDGA KSTETSGGTN TLEKIKKRGK LVVGVKYDLN LFGLKNPETG KVEGFDIDIA KGLAKKILGD ENKIELKEVT SKTRIPMLNN GEIDAIIATM TITEERKKEV DFSDVYFMAG QSLLVKKDSK INSVKDLKKG MTVLTAKGST SAQNIRKVAP EVNVLEFENY AEAFTALKAG QGDALTTDNA LLLGMAKQDP NYRVLDETFT EEPYGIAVRK GDKEFLQVIN EYLKEIKENG EYDKIYEKWI GKKPQQ
|
| |