Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1273 |
Symbol | |
ID | 7976054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1322427 |
End bp | 1324208 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644798218 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002949391 |
Protein GI | 239826767 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGAAAAA AATGGCAGTG GAAGTTTTTT GTGGTGCTCG TTGTCCTTTT ATTAGGATTA ACGGCGTGCA GCAACAACTC CACAAACAAC AAAGACAGCA AAAATCAAAC GGCGCAAAAG GAAGACATCA GCAAATTCCC AATGACCGTC AAAAATGACG GGAAAATCGT AGATGGTGTA TTAAAATACG GATTAGTATC CGATACACCA TTTGAAGGAA CGCTTAGCTA TGCGTTTTAC AAAGGGCAAC CGGATGCAGA AATTTTGCAG TTTTTCGATG AATCACTTTT CCGTACAAAC GGAGACTATG AAATTACGAA CGATGGCGCG GCGACATACG AACTTTCCGA TGATAAAAAA ACGATAACGA TCAAAATTAA AGATAATGTC AAATGGCATG ACGGTGAGCC TGTGAAAGCG GAAGATTTAG AATACGCTTA CTTAGTCATC GGCCATAAAG ATTATACGGG TGTCCGCTAC GGAGATGCGC TCATTCAAGA TATTGTCGGC ATGGAGGAAT ACCATAGTGG AAAAGCGGAC AAAATCTCCG GAATTAAAGT CATTGATGAC AAAACACTAA CCATTACATG GAAACATGCC AATCCATCTG TGCTGACAGG CATTTGGGCC TATCCGCTCC CAAAACATTA TTTAAAAGAT GTTCCGATTA AAGATTTGGC GAAATCGGAT AAAATCCGCA AAAACCCAAT TGGTTTTGGT CCATTTAAAG TGAAAAAGAT CGTTCCAGGC GAATCGGTGG AATTTGTTCG CAACGATGAT TATTGGGATG GAAAGCCGAA CTTAAAAGGG GTTATTTTAA AAGTTGTCAG CCCGCAAGTC GTATTGCAAG CTCTTAAAAA AGGCGAGATT GACATTGCGG AATTCCCGAC AGATCAATAT GTCAGCGCAA AAGGGACGAA AAACATTCAA TTTGTCGGCA AAATTGATTT AGCTTACAAC TACATCGGGT TTAAACTTGG CCACTGGGAT GCGAAAAAAC AAGAAAATGT CATGGATAAT CCAAAATTCC AAAACAAAAA GCTTCGCCAA GCGATGGCAT ACGCGATTAA TAACAAGGAA GTGGCTGACA GACTTTATCA CGGGTTGCGC TTCCCAGCGA CAACATTAAT TCCGCCATCC TTCCCAGGAT ACCATGATAA AAACGCAAAA GGATATACGT ATGATCCAGA AAAGGCGAAA AAATTGCTGG ATGAAGCCGG ATATAAAGAT GTCGACGGCG ACGGCTTCCG TGAAGATCCA AACGGCAAGA AGTTTACGAT TAACTTCCTG TCAATGAGCG GCGGGGATAT TGCCGAACCG CTGGCGAAAT TTTACATGCA ATGCTGGAAA GACGTCGGTT TGAATGTGCA ACTGGTTGAT GGCCGCTTGG CGGAATTCAA CTCGTTCCAT GACATGGTTG AAAAAGATAA CCCGAAAGTC GATGTGTTCG CCGCGGCGTG GTACACTGGA ACCAATGTGG ACCCGTATGG ATTATATGGA CGCGATGTGA TGTTTAACTA TTCTCGATGG GTGAACGAGA AAAATGATGA ATTGTTAGAA AAAGGACATT CCGAGCAAGC ATTCGATAAA GAATACCGCA AGAAAATTTA CAACGAATGG CAAGCGCTGA TGAATGAAGA AGTACCTGTC ATCCCGACTC TGTACCGCTC GATTATTTAT GCGGTAAACA ATCGTGTGAA AAACTTTACC GTCGACCCTA GCTCAAAATT AACATGGAAA GATGTTGGTG TCACTTCCGA AAAACCAGAA GTAGCGCAAT AA
|
Protein sequence | MRKKWQWKFF VVLVVLLLGL TACSNNSTNN KDSKNQTAQK EDISKFPMTV KNDGKIVDGV LKYGLVSDTP FEGTLSYAFY KGQPDAEILQ FFDESLFRTN GDYEITNDGA ATYELSDDKK TITIKIKDNV KWHDGEPVKA EDLEYAYLVI GHKDYTGVRY GDALIQDIVG MEEYHSGKAD KISGIKVIDD KTLTITWKHA NPSVLTGIWA YPLPKHYLKD VPIKDLAKSD KIRKNPIGFG PFKVKKIVPG ESVEFVRNDD YWDGKPNLKG VILKVVSPQV VLQALKKGEI DIAEFPTDQY VSAKGTKNIQ FVGKIDLAYN YIGFKLGHWD AKKQENVMDN PKFQNKKLRQ AMAYAINNKE VADRLYHGLR FPATTLIPPS FPGYHDKNAK GYTYDPEKAK KLLDEAGYKD VDGDGFREDP NGKKFTINFL SMSGGDIAEP LAKFYMQCWK DVGLNVQLVD GRLAEFNSFH DMVEKDNPKV DVFAAAWYTG TNVDPYGLYG RDVMFNYSRW VNEKNDELLE KGHSEQAFDK EYRKKIYNEW QALMNEEVPV IPTLYRSIIY AVNNRVKNFT VDPSSKLTWK DVGVTSEKPE VAQ
|
| |