Gene Haur_4364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4364 
Symbol 
ID5736224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5575793 
End bp5577868 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content53% 
IMG OID641281525 
Productshikimate/quinate 5-dehydrogenase 
Protein accessionYP_001547124 
Protein GI159900877 
COG category[R] General function prediction only 
COG ID[COG5322] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0103685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAG TAGTCTGTAT CACGCTCGGA CGTTCGCGGC GTGACTTTAG CTTTACAACA 
ACGCTGTTGG GCGAAGAGCT ACGGGTGCGC CGCATTGGAG CCGATGGCGA TGTCGAACGG
GTCAAGCAGT TGATTCGTGA GCACGATGGC AAGGTTGATG CGATTGCGCT TGGTGGTGTA
ATTGCCAACT TTCGGGTTGG CAAGGCCAGC TATCAACATA ACCAAGCGTA CACCATCGTC
AATCAAGCAC GAGTTACGCC AACCGCCGAT GGAGTGTTGC TCAAGGCAAC CCTCGAACGT
TGGACGGTGG CGCAGGCGGT TTCGCGTGAA CCAGGCCGCT TCAACTATCG TCGGGTTTTG
GTCTTTTCGG GGATTGAGCG CTATTCTTTA GCTGAATCGC TCAGCGGTTA TAACGTTGAT
TTGCGTTTTG CCGACCCCAA GGTGCATTAT GGTTTGCCCT TCACGTTGAG TTCGCTGAGC
CAACTGGAGC GCTACGCCAA ATTTGCCATG CCCGATTTGG CCAAAAAGCC CTATCGGCGG
ATTCACCCGA TTGGCAAGGG CGCGACCCAC GATAGCCGTC TCGAAAAAGA TTGTGCTTGG
GCCGATGTGT TAGCTGGTGA TTTTGCCTTT ATTCGGCGCT ACGCCCCGCA AGATCTGCGA
GGTCGCACGA TTTTGACCGA CGATCCATCG CCTGCTGAAA TTGAAGATTT GCGCCAGCGT
GGAGCACATA CCTTAATTAC GCTCACGCCC AAAATTAGCG AAGAACATCC GTTTGTTTCG
GCAGATGTGC TCGAAGCTAT GATTTTGGCC GTTACAGGCA AGCGCACGCT TGATGAAGCC
ACGGTATTGC AAATTACCGC CGATGCTAAT TGGGAGCCGC ACATCCAGCG TTTGACCAAC
GACGAAGAAT TAGAAAAATT TGCCTTTGTG ATTCACCCGC TCTCAACCAA ATTTATTTAT
AAAGATCCCC GCTTCAAAGT CTTCAAATTT GTGCCCCAAC GTTGGGTTGA ACGCGCCATG
GCCCACTTGC CACCGCTGTA TCTCTCGCGT ATGAAGGGTA TTAAATCAAC TGGTACAGGC
AAAGAAATCG AAGGCATTTT GCTGACCTTG GGCGCTACGC CCCGCGAATT GATGCGCCGC
CCAACTGCCT TTACCTATCG CCGTTTGATC AAGGCTGCCC GTATGGCCGA GCGCATGGGC
GCGAAGCTGA TGGGCTTGGG GGCATTCACT TCGGTGGTTG GTGATGCTGG CATCACGGTT
GCCCAAAAAT CCGATATTGG CATCACCTCA GGCAACTCGT TGACTGTGGC CGCCACCCTT
GAAGCCGCCA AACAAGCGGT CATTCTTATG GGTGGTCGAG TTGATCAAGG CACGGCAGTG
GTGATTGGGG CAACTGGTTC GATTGGCGCA GTTTGTTCGC GCCTGCTAGC CCAAGCGATT
GGCGATGTGG TTTTGATTGC GCCACGACCT GAGCGTTTGA TCGCCTTGAA AAAGCAAATC
GAGGCTGAAA CGCCCAACGC CAAAGTAACA ATTGCCACCA AAGCTGATGA TTATGTTGGC
AGTGCCGACT TAATTGTTAC CACCACCACC GCCCTCAACA CCAAAATTGT CGATATTGAG
CGCTTGAAGC CAGGTGCGGT GGTGTGTGAT GTGGCACGGC CACCCGATAT CAAAGAAGAT
GAAGCCGCCA AACGCCCTGA TGTGTTGGTG ATTGAATCGG GCGAAATCAC CTTGCCAGGC
GAGGTTGATT TTGGCTTTGA TATTGGTTTG CCGCCAGGTA CAGCCTATGC ATGTCTCTCG
GAGACGGCTT TGTTGGCGCT TGATGGCAAG TTTGAAGATT ACACGCTTGG CCGTAATATC
GAAATGGATC GGGTCAAGGA GATGTATCGC TTGTTCAAAA AGCATGGCCT CAAATTGGCT
GGCCTGCGCA CCTTCGACCA ATATGTAACC CCCGAAATGG TCGCCGAAAA GCGCCGATTG
GCCGATCATC GTCGGCACGA GCTGGGCTTG CCAGTGACCA CCGAAAGCGA AACCCTTACT
AGCGAAATGC CGCTGGAAGT CGGTGGCTCC AACTAA
 
Protein sequence
MKEVVCITLG RSRRDFSFTT TLLGEELRVR RIGADGDVER VKQLIREHDG KVDAIALGGV 
IANFRVGKAS YQHNQAYTIV NQARVTPTAD GVLLKATLER WTVAQAVSRE PGRFNYRRVL
VFSGIERYSL AESLSGYNVD LRFADPKVHY GLPFTLSSLS QLERYAKFAM PDLAKKPYRR
IHPIGKGATH DSRLEKDCAW ADVLAGDFAF IRRYAPQDLR GRTILTDDPS PAEIEDLRQR
GAHTLITLTP KISEEHPFVS ADVLEAMILA VTGKRTLDEA TVLQITADAN WEPHIQRLTN
DEELEKFAFV IHPLSTKFIY KDPRFKVFKF VPQRWVERAM AHLPPLYLSR MKGIKSTGTG
KEIEGILLTL GATPRELMRR PTAFTYRRLI KAARMAERMG AKLMGLGAFT SVVGDAGITV
AQKSDIGITS GNSLTVAATL EAAKQAVILM GGRVDQGTAV VIGATGSIGA VCSRLLAQAI
GDVVLIAPRP ERLIALKKQI EAETPNAKVT IATKADDYVG SADLIVTTTT ALNTKIVDIE
RLKPGAVVCD VARPPDIKED EAAKRPDVLV IESGEITLPG EVDFGFDIGL PPGTAYACLS
ETALLALDGK FEDYTLGRNI EMDRVKEMYR LFKKHGLKLA GLRTFDQYVT PEMVAEKRRL
ADHRRHELGL PVTTESETLT SEMPLEVGGS N