Gene Bcer98_1240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_1240 
SymbolaroB 
ID5345175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp1350061 
End bp1351170 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content38% 
IMG OID640838830 
Product3-dehydroquinate synthase 
Protein accessionYP_001374557 
Protein GI152975040 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGTA TACACATTCA AACAACATCC AAAAAATATG ATGTATATGT TGGCAAACAT 
GTCCTATCCT CTTTAACAGA GGTTGTTCAA CGTATGAAAC CAGCTGTTTC AAATGTTATG
ATCATTTCTG ATGAATCTGT TGCAACCTTG CATTTACAAA AAGTAAAAGA GGCTTTGCAA
ATAAAGCAAG ATGTGTTTTC ATTTGTTATT CCAAGTGGAG AAAAAGAAAA GTCATTTGAA
AATTTCTATG CAGTCCATAC AGCGGCACTT GAGAACAAGC TCGATCGTAA TTCATTAATA
ATTGCACTAG GTGGCGGAAT GATTGGCGAT TTAGCAGGAT TTGTAGCAGC TACATTTATG
CGGGGGATTC GCTTTGTTCA AGTACCCACA ACACTGTTGG CACATGATAG TGCAGTGGGC
GGGAAGGTGG CCATTAACCA TCCTTTAGGG AAAAACATGA TCGGGGCGTT TCATCAGCCA
GAAGCGGTGT TATACCATAC GCCATTTCTA GATTCATTAC CTGAAAAAGA ATGGCGCTCT
GGCTTTGCTG AAGTAATCAA ACACGCTTTA ATCGGAGATG TAGAATTATA TCATTGGTTA
AAAAACAATG TGACAACATT GGCGGATTTA CGAGATGATA AATTAGTTTA TGTATTAAAA
CGTGCAATCC CTGTCAAAGC GAAGATTGTA GCGCAAGATG AGACAGAAAA AGGGGTGCGT
GCACATTTGA ACTTTGGGCA TACATTAGGA CATGCCTTAG AAAAAGAATC GGGATATGGC
AATATCACGC ATGGTGACGG TGTTGCAATC GGCATGTTAT TTGCCATATT TTTAAGTGAA
CAAATGTATA AGATTGACCT CAGGTATAAA GAATTAAAAC AGTGGTTTTT GCAGTATGGT
TACCCGAGCA TACCAAGGCA TTTGAAGGTG GATCGTCTTG TAAATGTTAT GAAACAAGAT
AAAAAAGCAA ATGCTGGAAC AATTCGTATG GTACTTATGC AGGAATATGG GGGCGTACAT
GTCGTATCTA TTTCAGATAA GACCGTTCAC ACTTCATTAG AAGCATTTCA AAAAGATATG
GTACTAGGTG AAGAAATGAA TTTTGAATGA
 
Protein sequence
MESIHIQTTS KKYDVYVGKH VLSSLTEVVQ RMKPAVSNVM IISDESVATL HLQKVKEALQ 
IKQDVFSFVI PSGEKEKSFE NFYAVHTAAL ENKLDRNSLI IALGGGMIGD LAGFVAATFM
RGIRFVQVPT TLLAHDSAVG GKVAINHPLG KNMIGAFHQP EAVLYHTPFL DSLPEKEWRS
GFAEVIKHAL IGDVELYHWL KNNVTTLADL RDDKLVYVLK RAIPVKAKIV AQDETEKGVR
AHLNFGHTLG HALEKESGYG NITHGDGVAI GMLFAIFLSE QMYKIDLRYK ELKQWFLQYG
YPSIPRHLKV DRLVNVMKQD KKANAGTIRM VLMQEYGGVH VVSISDKTVH TSLEAFQKDM
VLGEEMNFE