Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0845 |
Symbol | |
ID | 7979333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 908978 |
End bp | 910123 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644797818 |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_002948991 |
Protein GI | 239826367 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCAAT TTAGACTTAA CTATTTAGTC GCATTTCTTT TTTCCATTAT CGTCATCCTT AGCGGCTGCA TGAATGAACA AGACGCCGTC AGTTCGAAAC CGGCCAAACA AGAAAACAGC ACGGCCGAAG AAGCAAGCGA AACAACAGCA GCAACAACCG AACAAGCAGA TATCCCTGAA GAATTAAAAA AACCAATAAA AATTGCCGCC ATTATGCAAA TGTCGATCGG CACGTTTTCT TCTCAGTATA TCGCTGGCGT GAAAGAACAA GTGCAAAAAT TTGGCGGCGA AGTGCAAATT TACAATGCGG ATAACGACTT AACGAAAATG GCTTCCTATG TCGAAACAGC CATTACGCAA AACGTCGATG CGATTCTGTT AGATCACGGT CGCGCTGATG CGTTGGAAGG CCCGGTGAAA AAGGCAGTCG AAAAAGGCAT TCCTGTTGTC GCATTCGATA ATGATTTAAA CATCCCAGGC GTAACAGTGA TTGACCAAGA TGACTACAGC CTTGCATGGA AAACGTTAAA AACGTTGGCC GAAGACTTAA ACGGCGAAGG AAACATCGTC ACCATTTGGG TTGGCGGATT TACACCGATG GAACGGCGCC ATGTGATTTA CGATGCGTTT AAAAAACGCT ATCCGAACAT CAAAGAAGTG GCGAAATTCG GTACGGCCAG CGCCAATACC GCTTTAGACA CACAAACGCA AATGGAAGCG ATTTTGAAAA AATATCCGAA TAAAGGCGAT ATTGACGCAG TGTTTGCGAC TTGGGATGAA TTTGCCAAAG GCGCAACACG TGCGATCGAG CAAGCAGGAC GCAATGAAAT TAAAGTATAT GGCATCGATT TAAGCGATGA AGATTTGCAA ATGATGCAAA AGCCAAATAG CCCTTGGGTC GCTACAACGG CAACGGATCC AGCCGAAGTC GGTCGCGTTC AAGTTCGCTT TGCGTATCAA AAAATTGCTG GCGAGAAAAC ACCAAACATT TATTCACTAG AACCGCATTT AGTCAAACGA TCCGATTTGC CTAACCAACA AGTTTCGATG AATGAGCTTC ACCAATACAT TTCCGGCTGG GGGCAATCAA ATGTTGCCAT TTCCCCTCGG ATGAAAACAT TGGAAGCACA GGTGAAAAAC AAATGA
|
Protein sequence | MKQFRLNYLV AFLFSIIVIL SGCMNEQDAV SSKPAKQENS TAEEASETTA ATTEQADIPE ELKKPIKIAA IMQMSIGTFS SQYIAGVKEQ VQKFGGEVQI YNADNDLTKM ASYVETAITQ NVDAILLDHG RADALEGPVK KAVEKGIPVV AFDNDLNIPG VTVIDQDDYS LAWKTLKTLA EDLNGEGNIV TIWVGGFTPM ERRHVIYDAF KKRYPNIKEV AKFGTASANT ALDTQTQMEA ILKKYPNKGD IDAVFATWDE FAKGATRAIE QAGRNEIKVY GIDLSDEDLQ MMQKPNSPWV ATTATDPAEV GRVQVRFAYQ KIAGEKTPNI YSLEPHLVKR SDLPNQQVSM NELHQYISGW GQSNVAISPR MKTLEAQVKN K
|
| |