Gene GWCH70_2449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2449 
Symbol 
ID7979007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2480926 
End bp2483205 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content41% 
IMG OID644799251 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_002950411 
Protein GI239827787 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.327807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGGGA ATATTGTTTA TATAGCGCTT GCTGCTGTTT TTGGGATCAT CGCTGGGTGC 
GGGAGGCCAA ACATTTCTTT GGTGTTTATC GTGCTGTATG CTATATTTCT TTTCACGCGA
AAAAGGCGCC TCTTTCTTTT TTCTATCATA ACGATTTGCC TTTTTTATAT GTATTTCGTT
TATATCGACC ATCATAATAA GACGAAACTG TCTGGCAACA TGACGTTATT TACTATTCGT
TTTATTGCTC CCATTTCGGT GGATGGGGAT CAACTAAAAT CGATTGTCAA AGTGGGAAAA
GAAAAGCTTC AGCTTATTTA CTACATAAAA TCGGCACAGG AAAAACAAGA GCTTTCTTCT
TTAACTCCTA AAGCAGTATG CACGGTGAAG GGAAGGCTTG AACGTCCGGC GCTACCGCGA
AATCCTAATG CTTTCGATTA TCGTCGTTAT TTACGATTTC ATCATATTCA TTGGATTCTC
AAACCCCAAT CAATCTCCTT GCAAGACTGC CACAACACAT CCCCAACGTT ATATGAATGG
CTTCTTTCGC TTCGTGAAAA AGGGCTGCAC ACGGTAGAAA CACATTTTCC ATCGGAAACG
GTTGGAATTG TCCAAGCGCT TTTATACGGA GAACGAGGGC AATTAGACGA GACGCTTTTG
GAAGGATACC AAAAACTCGG ATTAATCCAT TTATTAGCCA TTTCCGGGTT GCATGTCACG
TTATTAGTCG GAGCCATGTT TTCCATATTG ATTCGCTTTA TCACCAAGGA AACGGCAACG
ATAATCCTAT TGATTTTCCT TCCCATTTAT ATTGTGCTAA CAGGTGCTTC TCCTTCCGTT
ATTCGCGCCT CCTTTACAGC AATGATCTTT TTAGTTTCCG ATTACTGGAA ATCGAAGTTA
TCGCCGCTCG ATGCATTAAG TATAACGCTA GTCATGATGC TGCTTTTCGA TCCGTATATG
TTATGGGATG TCGGATTTCA GTTATCGTTT ATTGTAACAT TTGCTCTTAT TTTATCTTCC
CAAATGATAT TGTCGCATGA TTCTGTCTTT TGGCGCCTCT TTTTTACGGC TTTTATTGCC
CAGTTGAGCG CCCTTCCGTT TCTTTTATAC TATTTTTTCG AAGTTTCGTT ATGGAGCATT
CCCCTAAATA TTATTTTTGT GCCGCTTTAT TCCTTCGCCA TTATGCCGCT TTCGTTTGCA
GCTGTAGGCA CGCATTATAT ACATTCGCTT CTTTCTTTTC CTTTTATTTG GTTGTTACAA
AAAATTATCG TAATCTCTAG CGATGTTGTT GCATTTTTCT CATCTAACCA TTCGTTGTCA
CTCGTTCTAG GCCGGCCTTC TTTCTTTTTT CTTATCTGTT ATGCGGCCGC GATTTTTGCT
GCATTTGTAC AAATGGAGAA GCGGCGTTAC TTTAGTATAG GATGGGTCGC CTTCGTTATT
ACGCTTCATG CATGGAGCCC ATATATGGAT CGTTACGGTG AGGTCATTTT ACTCGATGTC
GGGCAAGGGG ATTGTATATA CATTGAATTG CCGTATCGAA AAGGAGTATA TCTTATTGAC
ACAGGAGGAA CACTTCCACA CCAAAGACAG CCATGGCAGG AACGAAAGCG GAAGTGGGAT
GTAGGAAAAG ACGTTGTCGT TCCATTTTTA AAATCAAACG GAGTGAGATA TATCGATAAA
CTTATTGCGA CACATGGAGA TTTGGACCAT ATTGGAGCGG CAGAAGAAAT CATTCGGCAT
TTTTCTGTCA AGCAAATGGT CATTGGCAAA GGAGAAACAA AAAATTCGAT ACAAGAAAAA
CTTGTTCAAC TTGCTGAACG ACAAAATATA GAAGTGGTGA AAATATCAAG GGGTGACAGA
TGGATAGAGG ATGGAATTTC ATTTTATGTA CTTCATCCAT GGAAAACGCA TCGTGATGAT
AATAACCATT CGATTGTATT GTATACAAAG CTCGGTGGAT TATCCTGGTT ATTTACGGGT
GATTTAGAGG AAACGGGGGA AAGGGAGTTA ATCAGCGTTT TTCCGCGCTT AGAAGCCGAT
GTGTTAAAAG TGGCCCATCA TGGGAGCGAT ACGTCTACAA CAGAGTTATT TTTAGAAAAA
GTACAGCCGA AGATTGCACT TATTTCCGTC GGCAAACATA ACCGTTACCA TCATCCTTCC
CCATATGTTA TCGAACGGCT CCGGAAAAGG AATGTCATTA TTTTGCGGAC AGACCAGCAT
GGCGCGATTC GCTATATTTA TTCGAAAAAT CGTGGAACCT TTTCCGTCAT GCTGCCATAG
 
Protein sequence
MRGNIVYIAL AAVFGIIAGC GRPNISLVFI VLYAIFLFTR KRRLFLFSII TICLFYMYFV 
YIDHHNKTKL SGNMTLFTIR FIAPISVDGD QLKSIVKVGK EKLQLIYYIK SAQEKQELSS
LTPKAVCTVK GRLERPALPR NPNAFDYRRY LRFHHIHWIL KPQSISLQDC HNTSPTLYEW
LLSLREKGLH TVETHFPSET VGIVQALLYG ERGQLDETLL EGYQKLGLIH LLAISGLHVT
LLVGAMFSIL IRFITKETAT IILLIFLPIY IVLTGASPSV IRASFTAMIF LVSDYWKSKL
SPLDALSITL VMMLLFDPYM LWDVGFQLSF IVTFALILSS QMILSHDSVF WRLFFTAFIA
QLSALPFLLY YFFEVSLWSI PLNIIFVPLY SFAIMPLSFA AVGTHYIHSL LSFPFIWLLQ
KIIVISSDVV AFFSSNHSLS LVLGRPSFFF LICYAAAIFA AFVQMEKRRY FSIGWVAFVI
TLHAWSPYMD RYGEVILLDV GQGDCIYIEL PYRKGVYLID TGGTLPHQRQ PWQERKRKWD
VGKDVVVPFL KSNGVRYIDK LIATHGDLDH IGAAEEIIRH FSVKQMVIGK GETKNSIQEK
LVQLAERQNI EVVKISRGDR WIEDGISFYV LHPWKTHRDD NNHSIVLYTK LGGLSWLFTG
DLEETGEREL ISVFPRLEAD VLKVAHHGSD TSTTELFLEK VQPKIALISV GKHNRYHHPS
PYVIERLRKR NVIILRTDQH GAIRYIYSKN RGTFSVMLP