Gene Bcer98_3076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_3076 
Symbol 
ID5343886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp3120103 
End bp3121860 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content38% 
IMG OID640840570 
Productphenylalanine 4-monooxygenase 
Protein accessionYP_001376295 
Protein GI152976778 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3186] Phenylalanine-4-hydroxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGA CAGAAATTCC AGCGCATTTA AAGCCTTTTG TGTCTAAGCA ACATTATGAT 
CAGTACACAC CCATTAATCA TGCTGTATGG CGCTATATTA TGAGGCAAAA CCATAACTTT
CTAAAAGATG TGGCTCATCC AGCTTATGTG AACGGATTAA AATCATCTGG TATTAATATA
GACGCAATTC CAAAAGTGGA AGAAATGAAT GAATGTTTAG CACCAAGCGG TTGGGGAGCT
GTAACGATTG ATGGTCTGAT TCCCGGAGTC GCATTTTTTG ACTTTCAAGG TCATGGTTTA
CTACCAATCG CAACAGATAT TCGCAAAGTA GAAAATATTG AATATACACC AGCACCTGAT
ATCGTTCATG AAGCAGCAGG TCATGCTCCC ATCTTACTTG ATCCTACATA TGCCAAGTAT
GTAAAACGCT TTGGACAAAT TGGAGCAAAA GCTTTCTCAA CAAAGGAAGA GCATGATGCC
TTTGAAGCAG TTCGTACACT CACCATTGTA AAAGAAAGTC CAACTTCAAC ACCTGAAGAA
ATTGAAGCAG CGGAAAAAGA AGTAATCGAA AAACAAAAAC TAGTTTCAGG TGTATCAGAA
GCGGAACAAA TTTCTCGCCT TTTTTGGTGG ACAGTTGAAT ATGGACTGAT TGGCGATCTA
GACAATCCTA AAATTTACGG AGCTGGTCTA CTGTCTTCTG TAGGCGAAAG TAAGTATTGT
TTAACAGATG CTGTTGAAAA AGTTCCATTC TCGCTCGAAG CTTGCATAAA GACAACATAC
GATGTGACGA AAATGCAGCC GCAATTATTT GTCTGTCAGT CATTTGAAGA ACTGATAGAA
GCACTTGAAG CATTTTCTAA AACAATGGCT TTTCAGACGG GTGGTGCGGA AGGATTAGAA
AAAGCAATTC GCTCTGAAAA CATAGCGACT GCTGAACTAA GTAGTGGTTT ACAAATTACA
GGTACATTCT CAGAGATGGT GCAAAATGAG GTTGGTGAAG TAATTTATCT AAAAACCAAT
ACACCAACCG CTTTAGCATT CAATCATAAG CAACTGCCTC ATCACTCAAC AGCTATACAC
GAAGATGGAT TTGGTACACC AATTGGTTTA TTGCAAAACA ATATAGCATT AGAAGATTGT
ACAGAGGAAT CTTTACAATC ATTAGGTATT CTAATTGGAA ACAATACTGA TCTTTCCTTT
GCAAGCGGTG TTCACGTAAA AGGAACTGTA ACTGATATTA TAAAACAGGA TGAGAAAGTC
GTTCTTATTT CCTTTACAAA TTGCACTGTT GTTTATAAAG ATCGCTTATT ATTTGATGCT
TCATGGGGAA CATTTGATAT GGCAGTTGGT TCTAACATTA CATCTGTATT CCCAGGTGCA
GCCGATGCAG CCTCATTCTT CCCCATGGAT GAAGAAATAG AAAAAACCCC CGCACCACTT
TCACTATCAG AGCTAGATCG TATGTATCAA ATGGTTCGAG ATATTCGAAA TAAAGGTGAG
CTGCAAGATT CAGATGTAGC ACAATTAGTA GCCATACATG AAGTATTAAA TCAATTCTAT
AAAAAAGAAT GGCTACTCCG CCTTGAAATA TTAGAGTTAC TTGTGGAACA TAACAAAGAT
CAAAAAACAG CCTCTTTCTT ACTGCAACAA CTCTCTACAT TTACAGAAAA TGAGTCTGTA
CAACGTTTAA TCCATAATGG ACTTGCTTTA CTTCCAATAA AGGATGTGAA AAATAATGCA
ACGATTAACA GATCATGA
 
Protein sequence
MKKTEIPAHL KPFVSKQHYD QYTPINHAVW RYIMRQNHNF LKDVAHPAYV NGLKSSGINI 
DAIPKVEEMN ECLAPSGWGA VTIDGLIPGV AFFDFQGHGL LPIATDIRKV ENIEYTPAPD
IVHEAAGHAP ILLDPTYAKY VKRFGQIGAK AFSTKEEHDA FEAVRTLTIV KESPTSTPEE
IEAAEKEVIE KQKLVSGVSE AEQISRLFWW TVEYGLIGDL DNPKIYGAGL LSSVGESKYC
LTDAVEKVPF SLEACIKTTY DVTKMQPQLF VCQSFEELIE ALEAFSKTMA FQTGGAEGLE
KAIRSENIAT AELSSGLQIT GTFSEMVQNE VGEVIYLKTN TPTALAFNHK QLPHHSTAIH
EDGFGTPIGL LQNNIALEDC TEESLQSLGI LIGNNTDLSF ASGVHVKGTV TDIIKQDEKV
VLISFTNCTV VYKDRLLFDA SWGTFDMAVG SNITSVFPGA ADAASFFPMD EEIEKTPAPL
SLSELDRMYQ MVRDIRNKGE LQDSDVAQLV AIHEVLNQFY KKEWLLRLEI LELLVEHNKD
QKTASFLLQQ LSTFTENESV QRLIHNGLAL LPIKDVKNNA TINRS