Gene GWCH70_2622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2622 
Symbol 
ID7978285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2657208 
End bp2658233 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content40% 
IMG OID644799423 
Productaliphatic sulfonates family ABC transporter, periplsmic ligand-binding protein 
Protein accessionYP_002950582 
Protein GI239827958 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000404129 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAAC ATATATTGTT TTTGCTTATA AGCTTGTTTA TCGCATTGAT GTCTGGCTGC 
GGACAGGAAG CAACGACAAG TACAACAGCG AAAGGAAAAG AAAAAAACAT AACGATTCGC
ATCGGCATTC AGCAAAGCCT TGGACCGCTT TTACTGGCAA AAGAAAAAGG ATGGTTTGAA
AAAGAATTTG CCAAAGAGGG AGTCAACGTC AAATGGATTG AGTTTCAAAG CGGGCCGCCG
CATTTTGAAG CGATGGCATC TAACAATCTT GATTTCGGCG CTGTTGGAAA CTCTCCGGTA
ATTTCAGCCC AAGCCGCTAA CATTCAATTT AAGGAGATTA GTAAAGCAGC AGAAGGATTA
AAAGGAGATG CCATCATTGT GCCGAAAGAA AGCAAAATTC GTAGTTTAAC AGATTTGAAA
GGAAAAAAAA TCGCTGTTGC CAAAGGAAGC AGTGGATTCA ACTTCTTATA TAAAGCCCTC
GAGCATGCCG GCTTGAAAGC GTCAGATGTT GAAATGATTC AATTGCAGCC GGATGAAGCG
CAGGCAGCGT TTGATACACA TAAAGTGGAT GCTTGGGCGA TTTGGGAGCC GTTTATTTCC
TACGAGGTGA TCAAAAATAA AGCACGTATC GTAGCGGATG GAGAGGATCT TCATGCATAT
TCGCCATCGT TTATCGTGGC ACGGACGGGA TTTATCAAAG AGAATCCGGA TTTAACGGTT
CAATTTTTGA AAATTTATGA AAAAGCTCGA CGTTGGCAAA ATGATCATTT TGATGAAGCG
GTGGAAATTT ATGCGAAAGC GAAAAAGCTA GATAAAGATG TCATAGTGCG GGCGTTACGC
AACAACCCAT CATTAAACGA GCCAATTACG GATGATGTTG TTCAAGCACA GCAAAAAACC
GCCGATTTTC AATATGCTCA ACATATCATT AAAACCAAAA TTGATACAAG CAAAGTGGTC
GAAAATCGAT ATATTAAAAA AGCATTACAA GAATTAGAGA AAGAAGGTGA GAACAAACAT
GAATAA
 
Protein sequence
MRKHILFLLI SLFIALMSGC GQEATTSTTA KGKEKNITIR IGIQQSLGPL LLAKEKGWFE 
KEFAKEGVNV KWIEFQSGPP HFEAMASNNL DFGAVGNSPV ISAQAANIQF KEISKAAEGL
KGDAIIVPKE SKIRSLTDLK GKKIAVAKGS SGFNFLYKAL EHAGLKASDV EMIQLQPDEA
QAAFDTHKVD AWAIWEPFIS YEVIKNKARI VADGEDLHAY SPSFIVARTG FIKENPDLTV
QFLKIYEKAR RWQNDHFDEA VEIYAKAKKL DKDVIVRALR NNPSLNEPIT DDVVQAQQKT
ADFQYAQHII KTKIDTSKVV ENRYIKKALQ ELEKEGENKH E