Gene Bcer98_3359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_3359 
Symbol 
ID5343822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp3427995 
End bp3429068 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content39% 
IMG OID640840846 
Productglutamyl aminopeptidase 
Protein accessionYP_001376569 
Protein GI152977052 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAG AGACATTAGA GCTATTTCGT ACATTAACAG AACTACAAGG AGCATCAGGA 
TTTGAGCATG ATGTACGTCG CTTTATGAAG CAAGAATTAA GCAAATATGC AGATGAGATT
GTGCAGGATG GTTTAGGCAG TATATTTGGT CTGAAAAAAG GAGACGAGAG TGGTCCTCGT
GTACTGGTAG CAGGTCATAT GGATGAAGTT GGCTTCATGG TAACACAAAT TACGGAAAAC
GGGATGATTC GTTTTCAAAC ATTAGGTGGT TGGTGGAGTC AAGTATTACT AGCTCAGCGT
GTACAGATTA TGACAAAAAA TGGTCCTATT GTTGGGGTAA TTGGTTCTAT TCCACCGCAT
TTATTAAGTG ACGCGCAGCG TGCAAAACCA ATGGATATTA AGAATATGTT AATTGATATT
GGTGCTGACA GCTATGAAGA AGCACTTGAA ATCGGTGTGA AACCAGGGCA ACAAATTGTT
CCAATTTGTC CGTTTACACC GATGGCAAAT GAGAAGAAAA TTATGGCGAA AGCTTGGGAC
AACCGTTATG GTTGTGGTTT GGCAATTGAA TTATTAAAAG AATTAAAGGA TGAAACTTTG
CCAAATATAT TATATTCTGG TGCAACTGTT CAAGAAGAAG TAGGACTTCG TGGTGCACAA
ACAGCTGCGA ATATGATTCA GCCGGATATT TTCTATGCGC TTGATGCAAG TCCAGCGAAT
GATGCATCTG GTGATAAAGA GCAGTTCGGA CAATTAGGAA AAGGGGCGCT TCTTCGTATT
TATGACCGCA CAATGGTAAC ACATCGCGGG ATGCGTGAAT TTATTTTAGA TACAGCAGAA
ACACATAATA TTCCGTATCA ATATTTTATT TCACAAGGTG GTACAGATGC AGGACGTGTA
CATACAAGCA ACTCCGGTAT TCCATCAGCA GTAATCGGTG TTTGTGCTCG TTATATTCAT
ACACACGCTT CAATTTTACA TGTTGATGAT TATGCGGCGG CAAAAGAATT ATTGATGAAG
CTTGTTAAAG CGACAGATAA AACGACGCTG GAAACAATTA AAAATAGTGC GTAA
 
Protein sequence
MNKETLELFR TLTELQGASG FEHDVRRFMK QELSKYADEI VQDGLGSIFG LKKGDESGPR 
VLVAGHMDEV GFMVTQITEN GMIRFQTLGG WWSQVLLAQR VQIMTKNGPI VGVIGSIPPH
LLSDAQRAKP MDIKNMLIDI GADSYEEALE IGVKPGQQIV PICPFTPMAN EKKIMAKAWD
NRYGCGLAIE LLKELKDETL PNILYSGATV QEEVGLRGAQ TAANMIQPDI FYALDASPAN
DASGDKEQFG QLGKGALLRI YDRTMVTHRG MREFILDTAE THNIPYQYFI SQGGTDAGRV
HTSNSGIPSA VIGVCARYIH THASILHVDD YAAAKELLMK LVKATDKTTL ETIKNSA