Gene HS_1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1114 
SymbolaroB 
ID4240615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1250334 
End bp1251422 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content38% 
IMG OID638104677 
Product3-dehydroquinate synthase 
Protein accessionYP_719326 
Protein GI113461257 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0407056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTGTG TAAATGTAGA ACTAAGGGAA CGCAGTTATC CTATCCATAT CGGAATGGGG 
TTGTTATCCG AAGCACAAGT TTATCCGCTA AAAAAAGGCG ATAAAGTGAT GATTGTAACT
AACCCTACAA TTGCACAGTA TTATTTATCC TCTGTAACAG ACACCTTAGA AAAAATTGGT
TGCTCGGTAG AGAATGTGCA ACTCCCAGAA GGTGAACAAT ACAAAACTTT AGAATCTCTA
GACTTAATTT TTACCGCACT TTTAAAAGCT AATCATGGAC GAGATACCTC TATTATTGCA
CTGGGTGGCG GTGTGATCGG TGATATTGCC GGATATGCAG CGGCAAGCTA TCAACGTGGT
GTCCGTTTTA TTCAAATTCC AACCACATTA CTTGCTCAAG TAGATTCTTC CGTTGGGGGA
AAAACGGCTG TCAATCACAA ATTAGGAAAA AACATGATCG GTGCTTTTTA TCAACCTTGT
GCCGTTATTA TTGATACGCT AACGCTAACT ACTTTACCTA AAAGGGAAAT TCACGCAGGT
TTAGCTGAAG TCATTAAATA TGGTGCTATT TTAGATGATG AATTTTTTAC ATGGCTAGAA
AAACATATAA CTAATTTAGT TGCTTTAGAA CAACAATATT TACAGCAGTG CATTGCTCGC
TGTTGTCAAA TTAAAGCAGA TGTGGTTACT CGTGATGAAA CTGAAAAAGG AGAGCGTGCG
TTATTAAATT TAGGTCATAC TTTCGGGCAT GCTATCGAAA CTCACCTTGG ATATGGAAAT
TGGTTACATG GAGAAGCAGT CGCAACCGGA ATGATGATAG CTGCGATCTT GTCTAATAAA
TTAGGTGATT TATCACTTAA CGATGTAACG AGACTGGAGA AACTTTTAAT TCAAGCAGAT
TTACCTACAG CCTCACCTGA TACAATGAAA GCTGAAGATT ATCTACCACA TATGATGCGT
GATAAAAAGG TTCTTGCTGG AAAATTACGC CTAGTCTTAC TAAAATCACT TGGTCAAGCC
TACGTTGCAA CAGATACAGA CAAAGAATAC GTGCTTGATG CAATTCGTAC CTGTTCAAAA
AAAAGTTAA
 
Protein sequence
MLCVNVELRE RSYPIHIGMG LLSEAQVYPL KKGDKVMIVT NPTIAQYYLS SVTDTLEKIG 
CSVENVQLPE GEQYKTLESL DLIFTALLKA NHGRDTSIIA LGGGVIGDIA GYAAASYQRG
VRFIQIPTTL LAQVDSSVGG KTAVNHKLGK NMIGAFYQPC AVIIDTLTLT TLPKREIHAG
LAEVIKYGAI LDDEFFTWLE KHITNLVALE QQYLQQCIAR CCQIKADVVT RDETEKGERA
LLNLGHTFGH AIETHLGYGN WLHGEAVATG MMIAAILSNK LGDLSLNDVT RLEKLLIQAD
LPTASPDTMK AEDYLPHMMR DKKVLAGKLR LVLLKSLGQA YVATDTDKEY VLDAIRTCSK
KS