Gene GWCH70_2745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2745 
Symbol 
ID7979148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2782652 
End bp2783728 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content48% 
IMG OID644799542 
ProductGlutamyl aminopeptidase 
Protein accessionYP_002950701 
Protein GI239828077 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000840882 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTAG AAACGCTTCG ACTGTTTCAA ACATTGACCG AATTGCCGGG AGCACCAGGC 
AATGAACATG CGGTGCGGAA TTTCATGCGC AAAGAGCTGG AAAAGTACGC GGATGAAGTC
GTACAAGACC GGCTTGGCAG CATCTTTGGC GTCAAACGCG GCGATGAAAA CGGTCCTACC
GTGATGGTTG CAGGGCATAT GGACGAAGTC GGCTTTATGG TCACCGCTAT TACCAATAAC
GGTATGATTC GTTTTCAGCC GCTTGGCGGC TGGTGGAATC AAGTATTGTT AGCACAGCGC
GTACAAATCA TTACCGATCA TGGTCCAGTT GTTGGAGTGA TCAGCTCGAT TCCGCCGCAT
TTGTTGAGCG AAGAACAACG AAACAAGCCG ATGGAGATCA AAAACATGCT CATCGACGTT
GGTGCTGATG ACCGTGAAGA CGCGAAAAAA ATGGGGATTA AACCAGGACA ACAAATTGTA
CCGATTTGTC CATTTACCCC AATGGCCAAT CCGAAAAAAA TTTTGGCAAA AGCGTGGGAC
AATCGTTATG GCTGTGGATT GGCGATTGAA TTGTTGAAAG AGTTGAAGGA TGAGAAACTG
CCAAACGTGC TATATTCAGG TGCTACTGTC CAAGAAGAAG TCGGGTTGCG CGGGGCGCAA
ACCGCCGCAA CCATGATTCA GCCTGATATC TTTTTCGCGT TAGACGCAAG CCCGGCGAAC
GATATGACCG GAGACGCGAA AGAATTTGGG CATCTTGGAA AAGGGGCGCT TGTCCGCATT
TATGACCGTT CGATGGTGAC TCATCGCGGT ATGCGTGAAT TTGTATTAGA TACGGCGGAA
ACGCACGGCA TTCCATATCA ATTTTTCGTT TCACCGGGTG GAGGAACGGA TGCCGGAAGA
GTGCATATCG CCAACAGTGG AGTGCCTTCT GCCGTGATCG GTATTTGTTC CCGTTATATT
CATACACATG CGTCGATCAT TCACGTCGAT GATTATCAGG CAGCAAAGCA ACTGCTTATT
GAACTTGTAA AGCGGTGCGA TAAAGCAACA GTCGATGCAA TTAAGAAAAA CAGCTAA
 
Protein sequence
MNVETLRLFQ TLTELPGAPG NEHAVRNFMR KELEKYADEV VQDRLGSIFG VKRGDENGPT 
VMVAGHMDEV GFMVTAITNN GMIRFQPLGG WWNQVLLAQR VQIITDHGPV VGVISSIPPH
LLSEEQRNKP MEIKNMLIDV GADDREDAKK MGIKPGQQIV PICPFTPMAN PKKILAKAWD
NRYGCGLAIE LLKELKDEKL PNVLYSGATV QEEVGLRGAQ TAATMIQPDI FFALDASPAN
DMTGDAKEFG HLGKGALVRI YDRSMVTHRG MREFVLDTAE THGIPYQFFV SPGGGTDAGR
VHIANSGVPS AVIGICSRYI HTHASIIHVD DYQAAKQLLI ELVKRCDKAT VDAIKKNS