Gene Haur_1553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1553 
SymbolaroB 
ID5733440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1804427 
End bp1805584 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content52% 
IMG OID641278692 
Product3-dehydroquinate synthase 
Protein accessionYP_001544324 
Protein GI159898077 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTATGC GTTCGATCCT GCAAGCGTTT GAAGTACGCT ATTCCTATCC GGTGCATTGT 
ACGCACCGAT TATTTGGCTT GGATAACCCA ATTCTCCATG AATTATTTGC GCCAAGCGCT
AGTCTACCGA AGCTTTGGGT TGTGCTTGAT CAGGCGGTGG CTGAGCATCA CCCCAATCTA
TTAACTGAAA TTGCAGCCTA TGCCCAAGCC AGCCAAGCCT TCAGCTTGGT TGAGCCAAGC
CTGATTTTGG CTGGTGGCGA AGCAATTAAG CAGACTACTG AGCCATTGCA AGCAGTCTAC
GATGGAATTA ATCGCTATGC GATCGATCGC CATTCGTATC TGATGGCAAT TGGCGGCGGG
GCGTTGATCG ATATGGTTGG CTATGCAGCG GCGACGGCGC ATCGCGGGGT GCGCTTAATT
CGCGTGCCAA CCACAGTTTT GGCCCAAAAC GATGCAGCGG TTGGGGTTAA AAATAGCATC
AATGCCTTTG GCAAAAAGAA TTTCTTGGGC ACATTTGCTC CGCCTTATGC TGTGCTCAAC
GATAGCCATT TTCTCACGAC GCTGAGTGAG CGCGATTGGC GCAGTGGCAT CGCCGAGGCA
ATCAAGGTAG CTTTGCTCAA AGATCCCGCC TTTTTTGCCA CGATTGAGCG TACTGCTGCG
GCCTTGCGTC AGCGTGATTT AGCCGTAATG GAAGATCAGG TTTTTCGCTG TGCCGAGCTG
CATTTGGCCC ACATCGCTGG TGGCGACCCG TTTGAGCGAG GCTCAGCGCG GCCCTTGGAT
TTTGGCCATT GGGCGGCGCA TAAACTTGAA CAGCTCAGCA ATTATAGTTT GCGCCATGGC
GAGGCGGTGG CAATTGGCAT CGCCTTGGAT TGCACCTACA GCTATTTAAA CGCTGATTTA
GCCGAGGCCG ATTGGCAACG GGTCTTGACT TGTCTAACCG CAGTTGGCTT CGAACTCTAT
CATCCTGCCC TGAGCAACCA GCTTGAACTG CCCGAACATC CCCAAAGTTT GCTGAGCGGC
TTGGCCGAAT TTCGCGAGCA CCTTGGTGGC CAATTAACCA TCACCTTAAT GCGCGGCATC
GGCCAACCCT ACGATGTTCA CACAATTGAT CTACCGATGA TGCAACAAGC CATTCGATAT
TTAGCCGAAC GGGCTTAA
 
Protein sequence
MVMRSILQAF EVRYSYPVHC THRLFGLDNP ILHELFAPSA SLPKLWVVLD QAVAEHHPNL 
LTEIAAYAQA SQAFSLVEPS LILAGGEAIK QTTEPLQAVY DGINRYAIDR HSYLMAIGGG
ALIDMVGYAA ATAHRGVRLI RVPTTVLAQN DAAVGVKNSI NAFGKKNFLG TFAPPYAVLN
DSHFLTTLSE RDWRSGIAEA IKVALLKDPA FFATIERTAA ALRQRDLAVM EDQVFRCAEL
HLAHIAGGDP FERGSARPLD FGHWAAHKLE QLSNYSLRHG EAVAIGIALD CTYSYLNADL
AEADWQRVLT CLTAVGFELY HPALSNQLEL PEHPQSLLSG LAEFREHLGG QLTITLMRGI
GQPYDVHTID LPMMQQAIRY LAERA