Gene GWCH70_0297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0297 
Symbol 
ID7979094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp336000 
End bp337127 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content47% 
IMG OID644797290 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_002948490 
Protein GI239825866 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1125] ABC-type proline/glycine betaine transport systems, ATPase components 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCGCT TTGAAAACGT ATGGAAACAA TATGACGACG GTTTTGTCGC ATTGAAAAAT 
ATTAATCTCG AAATTCAAAA AGGAGAATTG GTCACTTTAA TCGGGCCAAG CGGATGCGGA
AAAACGACGA CGATGCGGAT GATTAACCGC CTGACTGAAC CGACGTCTGG AACGATATAC
ATTGACGGGC AGGACATTGC AAAAATGAAT CCAGTGGAAC TACGACGCAA CATTGGCTAT
GTCATCCAGC AAATCGGGCT GTTCCCTCAT ATGACGATTG CGGAAAATAT CGCCTTAGTT
CCGAAACTAA AAAAATGGGA GCCGTCCGCC TATCAAAAAC GGGTTGACGA ACTGCTTGAT
CTTGTCGGAT TAGATCCAGC GATGTTTAAA CATCGCTACC CGTCGGAACT TAGCGGTGGC
CAGCAACAAC GAGTCGGCGT TATTCGCGCC CTTGCCGCAG AGCCTGACAT CATTTTGATG
GACGAGCCGT TCAGCGCGCT CGATCCGATC AGCCGCGAAC AGCTGCAAGA GGATATTGTG
AAATTGCAGG AAGAAATTCG AAAGACAATT GTGTTTGTCA CACATGATAT GGATGAAGCG
ATTAAAATTT CCAACCGTAT TGCAATTATG AAAGACGGAG AAATCGTGCA ATTTGCTACG
CCGGATCAAA TTTTGCGCCG TCCTGTCAAT TCATTCGTAC GCGACTTTAT TGGAGAGAAC
CGCCTTGCAC AAAGACAAAC GGCCGTGCCG ACAGCGGAAG ACTTAATGTC CCATTCCATC
GCTACGATAT CGCCGAAGCG CGGATTAGCC GAGGCCTTCC GGTTCATGAA AGAGAAAAAA
GTAGACAGCT TAATCGTTAC AGATAAAAAA CAATCCTTTC TTGGTGTCGT GACATTAAAA
AAACTAGAAA GACATTATCA GCAGGAACAT CTTCTTGTGA CCGACATCGC TGATTTCGAT
GTGACGACAC TAACAAAGGA TGCTGATGTG ACGGAAGTTG CGGAAATTTT CCAGCAACAA
GATGTCAGCG CCATCCCTGT ATTGGCCGGG AATCGCCTTG TCGGCGTTAT CACGAGATCG
AGCATGATGC GCGGGCTGGC GGAATGGGAG TTTCAAAAGC AACCGTGA
 
Protein sequence
MIRFENVWKQ YDDGFVALKN INLEIQKGEL VTLIGPSGCG KTTTMRMINR LTEPTSGTIY 
IDGQDIAKMN PVELRRNIGY VIQQIGLFPH MTIAENIALV PKLKKWEPSA YQKRVDELLD
LVGLDPAMFK HRYPSELSGG QQQRVGVIRA LAAEPDIILM DEPFSALDPI SREQLQEDIV
KLQEEIRKTI VFVTHDMDEA IKISNRIAIM KDGEIVQFAT PDQILRRPVN SFVRDFIGEN
RLAQRQTAVP TAEDLMSHSI ATISPKRGLA EAFRFMKEKK VDSLIVTDKK QSFLGVVTLK
KLERHYQQEH LLVTDIADFD VTTLTKDADV TEVAEIFQQQ DVSAIPVLAG NRLVGVITRS
SMMRGLAEWE FQKQP