Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2622 |
Symbol | |
ID | 7978285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2657208 |
End bp | 2658233 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644799423 |
Product | aliphatic sulfonates family ABC transporter, periplsmic ligand-binding protein |
Protein accession | YP_002950582 |
Protein GI | 239827958 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00000404129 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAAAC ATATATTGTT TTTGCTTATA AGCTTGTTTA TCGCATTGAT GTCTGGCTGC GGACAGGAAG CAACGACAAG TACAACAGCG AAAGGAAAAG AAAAAAACAT AACGATTCGC ATCGGCATTC AGCAAAGCCT TGGACCGCTT TTACTGGCAA AAGAAAAAGG ATGGTTTGAA AAAGAATTTG CCAAAGAGGG AGTCAACGTC AAATGGATTG AGTTTCAAAG CGGGCCGCCG CATTTTGAAG CGATGGCATC TAACAATCTT GATTTCGGCG CTGTTGGAAA CTCTCCGGTA ATTTCAGCCC AAGCCGCTAA CATTCAATTT AAGGAGATTA GTAAAGCAGC AGAAGGATTA AAAGGAGATG CCATCATTGT GCCGAAAGAA AGCAAAATTC GTAGTTTAAC AGATTTGAAA GGAAAAAAAA TCGCTGTTGC CAAAGGAAGC AGTGGATTCA ACTTCTTATA TAAAGCCCTC GAGCATGCCG GCTTGAAAGC GTCAGATGTT GAAATGATTC AATTGCAGCC GGATGAAGCG CAGGCAGCGT TTGATACACA TAAAGTGGAT GCTTGGGCGA TTTGGGAGCC GTTTATTTCC TACGAGGTGA TCAAAAATAA AGCACGTATC GTAGCGGATG GAGAGGATCT TCATGCATAT TCGCCATCGT TTATCGTGGC ACGGACGGGA TTTATCAAAG AGAATCCGGA TTTAACGGTT CAATTTTTGA AAATTTATGA AAAAGCTCGA CGTTGGCAAA ATGATCATTT TGATGAAGCG GTGGAAATTT ATGCGAAAGC GAAAAAGCTA GATAAAGATG TCATAGTGCG GGCGTTACGC AACAACCCAT CATTAAACGA GCCAATTACG GATGATGTTG TTCAAGCACA GCAAAAAACC GCCGATTTTC AATATGCTCA ACATATCATT AAAACCAAAA TTGATACAAG CAAAGTGGTC GAAAATCGAT ATATTAAAAA AGCATTACAA GAATTAGAGA AAGAAGGTGA GAACAAACAT GAATAA
|
Protein sequence | MRKHILFLLI SLFIALMSGC GQEATTSTTA KGKEKNITIR IGIQQSLGPL LLAKEKGWFE KEFAKEGVNV KWIEFQSGPP HFEAMASNNL DFGAVGNSPV ISAQAANIQF KEISKAAEGL KGDAIIVPKE SKIRSLTDLK GKKIAVAKGS SGFNFLYKAL EHAGLKASDV EMIQLQPDEA QAAFDTHKVD AWAIWEPFIS YEVIKNKARI VADGEDLHAY SPSFIVARTG FIKENPDLTV QFLKIYEKAR RWQNDHFDEA VEIYAKAKKL DKDVIVRALR NNPSLNEPIT DDVVQAQQKT ADFQYAQHII KTKIDTSKVV ENRYIKKALQ ELEKEGENKH E
|
| |