Gene Haur_3166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3166 
Symbol 
ID5735038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3998559 
End bp3999815 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content50% 
IMG OID641280309 
Productdihydropteroate synthase 
Protein accessionYP_001545931 
Protein GI159899684 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0294] Dihydropteroate synthase and related enzymes 
TIGRFAM ID[TIGR01496] dihydropteroate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00152005 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGAAT TTGCCTATAA CATCCGTCAA TATTATCCTC AGCAACGCCA AGAACTGTTG 
CAGATGATCG CCCAAATTGA AACCTATCCC AATGCTGCTG AGCGCGTATT GACCAAAGCC
AACTTGATTA TGCTGCATTG CGATCAGGTT GATCCACATA CGGCCATGAT TGTGAAGCAA
GAATTATTGG CGCTCGATGG CGATGCCTTG GTTAGCCCAC ACGTCTATCT GGGCCAAAGC
ACGCAGCCAA CCAATCTGCT GGCGTGGGCC AATGAACGCT CGTGGCGAGC CTTATGTGGC
AAGTTTCAAG CGATTCCTTT ACCTGCCTTG CAAGCCTTAG CCCAACAAAT TGGCGCGTTG
CTGTTGCATA ATCAAGCACG AGGCAGCCTC AAACTTGGCG CAACTCAATG GGATTGGGGT
CGAAAAACCT TGGTGATGGG CATTGTCAAT GTTACACCCG ATTCTTTCTC TAACGATGGA
TTGCTTGAGG TCGGAACCAG CCAGATTCAG CAGCAAGCGC TTGAGTTTGC CGCAGCTGGA
GCCGATATTT TGGATGTTGG CGGTGAATCG ACGCGGCCTG GAGCCAGCAG TGTCAGTATC
GCACAGGAGA TTGCGCGGGT TGTGCCAGCG ATTCAAGCGA TTCGTCACGT TTGCCAATTG
CCAATTTCGA TTGATAGCTA CAAAGCTGAG GTTGTGGCGG CTGCGCTTGA AGCTGGTGCA
AATGTGGTTA ATGATATTTG GGGTTTGCGC CAAGCCGATG GTAGTTGGAA TACGGCACTG
GCGCAGTTGG TGGCACAAGC AAACGTGCCA ATTATTTTGA TGCACAATCG AGTCAGCACG
GTTGAGCAAT TTGCCCATGG CACAAATTAC GCTGCTAGCG ACTATGGCGA TATTATCGGC
GAAGTTTGTG CCGAATTACG CCAAAGCATC GATTTCGCCC TGCAAGCGGG CATTGCCAAC
GATTTAATTG TGCTTGATCC AGGCATTGGT TTCGGCAAAA GCCCTGAACA AAATCTACAA
GTATTACGTC AACTACGGAC AATTGCAAGC TTAGGCTACC CGTTGTTGGT TGGCACTAGC
CGAAAATCCA TGATTGGGAT AACATTAAAC CGACCTGTTG ATCAACGCCT GTGGGGCACA
GCCGCCACCG TGGCCTATGC AATTCAGGCA GGAGCCGATA TTGTGCGGGT GCACGATGTT
GCGGCAATGG TCGATGTTTG TCGAATGACC GACGCTTTAG TTCGTCACGA AGGATAG
 
Protein sequence
MPEFAYNIRQ YYPQQRQELL QMIAQIETYP NAAERVLTKA NLIMLHCDQV DPHTAMIVKQ 
ELLALDGDAL VSPHVYLGQS TQPTNLLAWA NERSWRALCG KFQAIPLPAL QALAQQIGAL
LLHNQARGSL KLGATQWDWG RKTLVMGIVN VTPDSFSNDG LLEVGTSQIQ QQALEFAAAG
ADILDVGGES TRPGASSVSI AQEIARVVPA IQAIRHVCQL PISIDSYKAE VVAAALEAGA
NVVNDIWGLR QADGSWNTAL AQLVAQANVP IILMHNRVST VEQFAHGTNY AASDYGDIIG
EVCAELRQSI DFALQAGIAN DLIVLDPGIG FGKSPEQNLQ VLRQLRTIAS LGYPLLVGTS
RKSMIGITLN RPVDQRLWGT AATVAYAIQA GADIVRVHDV AAMVDVCRMT DALVRHEG