Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2782 |
Symbol | |
ID | 7978004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 2819373 |
End bp | 2820371 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644799577 |
Product | ABC transporter substrate-binding protein |
Protein accession | YP_002950736 |
Protein GI | 239828112 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA TTGCTATGAT ATCGCTTTTT CTCTTCCTTC TTATTATTCC GCTGGCTGCT TGCAACAAGC AAACGAAGGA GGAAATAAAG ACAGTACGTG TTGCGGAAGT AACTCGTTCG ATTTTCTATG CCCCGCAATA TGTTGCCCTA TCGAAAGGAT TTTTTAAAAA CGAAGGATTA AAGGTCGAAC TCACCACAAC ATGGGGCGGC GATAAAACGA TGACAACTCT TCTTTCGGGA GGAGCCGATA TCGCCCTCGT CGGTTCAGAA ACATCCATTT ATGTCTACAG CCAAGGAACA AATGATCCAG TCATTAACTT CGCGCAGTTG ACGCAAACGG ACGGAACTTT TCTTGTCTCC CGCAATAAAA TTGAAAACTT TACTTGGGAC CAACTTAAAG GTAGCACATT TCTTGGCCAG CGCAAAGGCG GTATGCCGCA AATGGTCGGA GAGTTTGTCT TGAAAAAGCA TGGCATCGAT CCACATAAAG ATTTAAAGCT AATTCAAAAC GTCGATTTTG CTAATATCGC GAACGCATTT GCCAGCGGTA CAGGTGATTT CGTCCAACTA TTTGAACCAA CAGCAAGCAT TTTCGAACAG GAAGGCAAAG GGTATATTGT CGCATCATTT GGCACGGAAT CCGGCCACGT CCCGTATACG ACTTTTATGG CAAAACAAAG CTTTATTAAA AACAACAAAG ATACGATCGA AAAATTCACA CGCGCACTTT ATAAAGCCGA GCAATGGGTG GAAACACATA GCGCAGCAGA AGTGGCAAAA GCAATTCAGC CTTACTTTAA AGATACGGAT ATTCAAATTA TCGAAAAAGT CGTCGATCGT TATAAAAGCC AAGGGACATA TGCGACAAAT CCGGTTCTTG ACAAAGAAGA ATGGAATAAT TTGCAAAACA TTATGGACGA AGCGGGTGAA TTGCCAAAAC GCATCGATCA TGAAACACTT GTCGATAATT CATTCGCGGA AAAAGTGATG TCCAAATAA
|
Protein sequence | MKKIAMISLF LFLLIIPLAA CNKQTKEEIK TVRVAEVTRS IFYAPQYVAL SKGFFKNEGL KVELTTTWGG DKTMTTLLSG GADIALVGSE TSIYVYSQGT NDPVINFAQL TQTDGTFLVS RNKIENFTWD QLKGSTFLGQ RKGGMPQMVG EFVLKKHGID PHKDLKLIQN VDFANIANAF ASGTGDFVQL FEPTASIFEQ EGKGYIVASF GTESGHVPYT TFMAKQSFIK NNKDTIEKFT RALYKAEQWV ETHSAAEVAK AIQPYFKDTD IQIIEKVVDR YKSQGTYATN PVLDKEEWNN LQNIMDEAGE LPKRIDHETL VDNSFAEKVM SK
|
| |