Gene GWCH70_1273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1273 
Symbol 
ID7976054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1322427 
End bp1324208 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content42% 
IMG OID644798218 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002949391 
Protein GI239826767 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAAAAA AATGGCAGTG GAAGTTTTTT GTGGTGCTCG TTGTCCTTTT ATTAGGATTA 
ACGGCGTGCA GCAACAACTC CACAAACAAC AAAGACAGCA AAAATCAAAC GGCGCAAAAG
GAAGACATCA GCAAATTCCC AATGACCGTC AAAAATGACG GGAAAATCGT AGATGGTGTA
TTAAAATACG GATTAGTATC CGATACACCA TTTGAAGGAA CGCTTAGCTA TGCGTTTTAC
AAAGGGCAAC CGGATGCAGA AATTTTGCAG TTTTTCGATG AATCACTTTT CCGTACAAAC
GGAGACTATG AAATTACGAA CGATGGCGCG GCGACATACG AACTTTCCGA TGATAAAAAA
ACGATAACGA TCAAAATTAA AGATAATGTC AAATGGCATG ACGGTGAGCC TGTGAAAGCG
GAAGATTTAG AATACGCTTA CTTAGTCATC GGCCATAAAG ATTATACGGG TGTCCGCTAC
GGAGATGCGC TCATTCAAGA TATTGTCGGC ATGGAGGAAT ACCATAGTGG AAAAGCGGAC
AAAATCTCCG GAATTAAAGT CATTGATGAC AAAACACTAA CCATTACATG GAAACATGCC
AATCCATCTG TGCTGACAGG CATTTGGGCC TATCCGCTCC CAAAACATTA TTTAAAAGAT
GTTCCGATTA AAGATTTGGC GAAATCGGAT AAAATCCGCA AAAACCCAAT TGGTTTTGGT
CCATTTAAAG TGAAAAAGAT CGTTCCAGGC GAATCGGTGG AATTTGTTCG CAACGATGAT
TATTGGGATG GAAAGCCGAA CTTAAAAGGG GTTATTTTAA AAGTTGTCAG CCCGCAAGTC
GTATTGCAAG CTCTTAAAAA AGGCGAGATT GACATTGCGG AATTCCCGAC AGATCAATAT
GTCAGCGCAA AAGGGACGAA AAACATTCAA TTTGTCGGCA AAATTGATTT AGCTTACAAC
TACATCGGGT TTAAACTTGG CCACTGGGAT GCGAAAAAAC AAGAAAATGT CATGGATAAT
CCAAAATTCC AAAACAAAAA GCTTCGCCAA GCGATGGCAT ACGCGATTAA TAACAAGGAA
GTGGCTGACA GACTTTATCA CGGGTTGCGC TTCCCAGCGA CAACATTAAT TCCGCCATCC
TTCCCAGGAT ACCATGATAA AAACGCAAAA GGATATACGT ATGATCCAGA AAAGGCGAAA
AAATTGCTGG ATGAAGCCGG ATATAAAGAT GTCGACGGCG ACGGCTTCCG TGAAGATCCA
AACGGCAAGA AGTTTACGAT TAACTTCCTG TCAATGAGCG GCGGGGATAT TGCCGAACCG
CTGGCGAAAT TTTACATGCA ATGCTGGAAA GACGTCGGTT TGAATGTGCA ACTGGTTGAT
GGCCGCTTGG CGGAATTCAA CTCGTTCCAT GACATGGTTG AAAAAGATAA CCCGAAAGTC
GATGTGTTCG CCGCGGCGTG GTACACTGGA ACCAATGTGG ACCCGTATGG ATTATATGGA
CGCGATGTGA TGTTTAACTA TTCTCGATGG GTGAACGAGA AAAATGATGA ATTGTTAGAA
AAAGGACATT CCGAGCAAGC ATTCGATAAA GAATACCGCA AGAAAATTTA CAACGAATGG
CAAGCGCTGA TGAATGAAGA AGTACCTGTC ATCCCGACTC TGTACCGCTC GATTATTTAT
GCGGTAAACA ATCGTGTGAA AAACTTTACC GTCGACCCTA GCTCAAAATT AACATGGAAA
GATGTTGGTG TCACTTCCGA AAAACCAGAA GTAGCGCAAT AA
 
Protein sequence
MRKKWQWKFF VVLVVLLLGL TACSNNSTNN KDSKNQTAQK EDISKFPMTV KNDGKIVDGV 
LKYGLVSDTP FEGTLSYAFY KGQPDAEILQ FFDESLFRTN GDYEITNDGA ATYELSDDKK
TITIKIKDNV KWHDGEPVKA EDLEYAYLVI GHKDYTGVRY GDALIQDIVG MEEYHSGKAD
KISGIKVIDD KTLTITWKHA NPSVLTGIWA YPLPKHYLKD VPIKDLAKSD KIRKNPIGFG
PFKVKKIVPG ESVEFVRNDD YWDGKPNLKG VILKVVSPQV VLQALKKGEI DIAEFPTDQY
VSAKGTKNIQ FVGKIDLAYN YIGFKLGHWD AKKQENVMDN PKFQNKKLRQ AMAYAINNKE
VADRLYHGLR FPATTLIPPS FPGYHDKNAK GYTYDPEKAK KLLDEAGYKD VDGDGFREDP
NGKKFTINFL SMSGGDIAEP LAKFYMQCWK DVGLNVQLVD GRLAEFNSFH DMVEKDNPKV
DVFAAAWYTG TNVDPYGLYG RDVMFNYSRW VNEKNDELLE KGHSEQAFDK EYRKKIYNEW
QALMNEEVPV IPTLYRSIIY AVNNRVKNFT VDPSSKLTWK DVGVTSEKPE VAQ