Gene GWCH70_0304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0304 
Symbol 
ID7977423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp347506 
End bp348543 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content44% 
IMG OID644797297 
Productpeptidase S58 DmpA 
Protein accessionYP_002948497 
Protein GI239825873 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3191] L-aminopeptidase/D-esterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA AGATTCGGGA ATTAGGCATT GAAATTGGAT CGCTGGAAAC GGGAAAGCAT 
AATCAGATCA CTGATGTTCC TGGAGTTAAG GTAGGACATG TAACATTAAA GAAAGAGTTG
GATGAAAAGA CGGTCATTCG AACAGGAGTT ACGGCTATTC TTCCTCATGG AGGCAATATT
TTTTTGCAAA AGGTGCCTGC TTCGTGTTTT GTTTTAAATG GATTTGGGAA AACAGCGGGA
TTAGTGCAAG TGGAAGAGTT AGGGGTTATG GAATCACCGA TTATGCTTAC GAATACGTTC
AGTGTGGGCA CAGTCTGGCA AGGAACATTG GAATATTTAA TGGAACAAAA TCAAGAAATA
GGGGATACGA CTAGTTCAGT GAATATCGTA GTAGGTGAAT GCAATGACAG CTATCTAAAC
ACGGTTCATT TTCCTGTCAT CGAAAAAGAA CATGCAAAAC TAGCGATAGA GCAGGCTGTT
TTTGATGTCG AAGAGGGTGC GGTGGGTGCA GGCACTGGTA CGATGTGTTT CGGCTATAAA
GGGGGAATTG GCAGTTCTTC GCGAATCATC CATGGAGGGA TTTACACCGT TGGCGCATTG
GTTCTTAGCA ATTTTGGCAA AAGGGAAGAG TTGTGCATTG CACAGTATCG AAAACCATCA
TTTGATGAAA CAGAAATTCC GAATGGTTCT ATTATGATTA TTGTCGCAAC CAACGCTCCT
TTGAGCTCTC GTCAATTGAA GCGGCTGGCA AAACGTGCAG CCTTCGGGCT CGCTCGAACA
GGAAGCCATA TTCACCATGG AAGCGGAGAT ATCATCATCG CATTTTCGAA CGGATACACT
ATCCCCCACT TTTCTGAGTC ATCTTATTAT CAACTTCCGC CGCTCATTCG CGATGATGAT
CCATTGATGA ATGAGCTGTT TCAAGCAGCC ATCGAGTCAA CGGAGGAAGC CATCTTAAAT
TCGTTGACAA TGGCAGAGAC GACAACCGGA CGGAACGGGC GAATTGGTGA GGCTATTCCG
TATGACCTTT TTCAATGA
 
Protein sequence
MRKKIRELGI EIGSLETGKH NQITDVPGVK VGHVTLKKEL DEKTVIRTGV TAILPHGGNI 
FLQKVPASCF VLNGFGKTAG LVQVEELGVM ESPIMLTNTF SVGTVWQGTL EYLMEQNQEI
GDTTSSVNIV VGECNDSYLN TVHFPVIEKE HAKLAIEQAV FDVEEGAVGA GTGTMCFGYK
GGIGSSSRII HGGIYTVGAL VLSNFGKREE LCIAQYRKPS FDETEIPNGS IMIIVATNAP
LSSRQLKRLA KRAAFGLART GSHIHHGSGD IIIAFSNGYT IPHFSESSYY QLPPLIRDDD
PLMNELFQAA IESTEEAILN SLTMAETTTG RNGRIGEAIP YDLFQ