Gene GWCH70_2065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2065 
Symbol 
ID7977300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2124397 
End bp2125638 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content46% 
IMG OID644798880 
Productpeptidase M29 aminopeptidase II 
Protein accessionYP_002950050 
Protein GI239827426 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.610998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGACT GGGAACAAAA TTTAGAAAAA TATGCCGCGC TCGCCGTACA AGTCGGCGTC 
AACGTCCAAA AAGGGCAAAC GCTCTTTGTA AACGCGCCGC TTGTGGCTGC TCCGCTTGTA
CGGAAAATCG CGAAAAAAGC ATACGAAGTT GGCGCCAAAC ATGTCTATGT CGAATGGAAT
GACGAAGATC TTACATACAT TAAATTCAAG TATGCTCCAG ATGAAGCGTT TTTGGAATAT
CCAATGTGGC GTGCGAAAGG AATGGAACAG CTTGCGGAAG AAGGTGCGGC GTTTTTATCC
ATCTATTCGC CAAACCCGGA TTTATTGAAA GACATTGATC CGAAACGGAT CGCAACGGCA
AATAAAACCG CATCTCAAGC ATTGCGCAAC TATCGCAGTG CCTTAATGGC GGATAAAAAC
TGTTGGTCAT TGATCTCCGT TCCTACGCCG GCGTGGGCGA AAAAAATATT TCCAGACCTC
AGTGAAGAAG AAGCGATCGA CAAGCTATGG GAAGCGATAT TCCGCATTAC CCGCGTCGAC
CAGGACGACC CTATCCAAGC GTGGCAGCAA CATAACGACC GACTCGCCAA AATCGTTGAT
TACTTAAACA ATAAACAATA TCAACAACTC ATCTATGAAG CGCCTGGAAC AAACTTAACG
ATCGAACTTG TAGAAAACCA TGTATGGCAT GGCGGTGCAG CCGTCAGCGA GAAAGGCGTT
CGTTTCAACC CGAACATCCC GACAGAAGAA GTGTTTACGA TGCCGCATAA AGACGGGGTC
AATGGCAAGG TACGCAATAC CAAACCGCTC AATTATAACG GCAACCTGAT AGATGGATTT
ACCCTTACGT TTAAAGATGG AAAAGTCGTT GACTTCACTG CAGAAAAAGG ATATGAAATA
TTAAAGCATT TATTGGACAC GGACGAAGGT GCGCGCCGAT TAGGAGAAGT GGCACTCGTT
CCACATCAAT CACCGATTTC CACATCGAAT TTAATTTTCT ATAACACATT ATTCGATGAA
AACGCCGCAT GTCACCTAGC GCTCGGAAAA GCATACCCAA CGAACATTCA AAACGGCACC
GCTATGTCCA AAGAAGAGCT CGACAAACAT GGGGTCAACG ATAGCCTCAT CCATGAAGAT
TTTATGATCG GCTCTGCCGA ACTAAACATC GATGGCGTCA CGAAAGACGG CAAGCGCGAA
CCAATTTTCC GCAACGGAAA CTGGGCGTTT GAATGGAAAT AA
 
Protein sequence
MSDWEQNLEK YAALAVQVGV NVQKGQTLFV NAPLVAAPLV RKIAKKAYEV GAKHVYVEWN 
DEDLTYIKFK YAPDEAFLEY PMWRAKGMEQ LAEEGAAFLS IYSPNPDLLK DIDPKRIATA
NKTASQALRN YRSALMADKN CWSLISVPTP AWAKKIFPDL SEEEAIDKLW EAIFRITRVD
QDDPIQAWQQ HNDRLAKIVD YLNNKQYQQL IYEAPGTNLT IELVENHVWH GGAAVSEKGV
RFNPNIPTEE VFTMPHKDGV NGKVRNTKPL NYNGNLIDGF TLTFKDGKVV DFTAEKGYEI
LKHLLDTDEG ARRLGEVALV PHQSPISTSN LIFYNTLFDE NAACHLALGK AYPTNIQNGT
AMSKEELDKH GVNDSLIHED FMIGSAELNI DGVTKDGKRE PIFRNGNWAF EWK