Gene Haur_4450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4450 
Symbol 
ID5736301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5692652 
End bp5694337 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content56% 
IMG OID641281613 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001547210 
Protein GI159900963 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0129872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTCAG ATACGATCAA GCGGGGCTTT GCACGCGCTC CACATCGTTC ACTCCTACGT 
GCGACGGGCC AAATTCAAGA TGATAGCGAC TTTCAAAAGC CGTTCGTCGC GATCTGTAAT
TCGTATATCG ATATTATTCC TGGCCACGTT CACTTGCATG AGTTTGCCAA AATCGTCAAA
GATGCAGTAC GGGCAGCAGG CGGCATTCCC TTTGAATTCA ACACGATTGG GGTTGATGAT
GGCATCGTGA TGGGCCACGA AGGTATGCGC TATTCCTTGC CATCGCGCGA GTTGATCGCC
GATTGTGTTG AAACGGTGGC AGCAGCCCAC TGTTTCGATG CGATGATCTG TATCCCCAAC
TGCGATAAAA TCGTGCCTGG CATGCTGATG GGCGCTGCCC GCGTCAACAT CCCCACCGTT
TTCGTATCGG GCGGGCCAAT GCAGGCTGGC CGCGATAAAG ATGGCAATAA AGTTGACTTG
ATCAGCGTGT TCGAGGGCGT TGGTCAACAT GCTTCTGGCC GCATCAGTGA TGAACGGCTG
CTGGATCTGG AGCGCAACGG CTGCCCGACC TGCGGCTCCT GCTCCGGCAT GTTCACCGCC
AATTCGATGA ATTGTCTGTG TGAAGCACTG GGAATCGCCC TGCCATACAA CGGTTCATTA
CTGGCCACCG ACCCTGCCCG CCACGAGCTA GCCCGCACCG CCGCCACCAA AGTCCTCGAA
TTGCTCCGCC AGAATATTTC CTTCAGCGAT ATTGTCACTC CCGAATCAAT TGATAACGCG
ATGGCGCTCG ACGTAGCCAT GGGCGGCTCG ACCAACACCA TCTTGCACGT TTTGGCCTTG
GCGCGTGAAG CAGGCTTGGA TTACCCAATC AGCCGCTTTA ACGAGGTTGC AGCTCGCGTA
CCACACTTGG CCAAAGTTAG CCCAGCCTGG GATGGCACTC GCCAATGGCA TATGGAAGAT
GTGCATCGCG CTGGTGGCGT ACCAGCAATT ATGGCCGAAT TGGCCAAAAA GCCCGATGCC
CTACACTTGA ATGTGCCAAC CGTGACAGGC CAAACCTTGG GTGAACAATT GCAAGGCATC
GCCAACGAAA ACCCTGAATG CATCCGCCCA ATCGAACATC CCCATTCAGC CCAAGGCGGT
TTGTGTATTT TGTTTGGCAA CTTGGCTCCT GAAGGCTCGG TGATCAAAAT TGGTGCAGTT
GATCAACATC AAATGACATT TAGCGGCCCG GCCCGTTGTT TCCCCAGCGA AGAAGCCGCC
ACCTACGCCG CCCGCGCTGG CGAAATTCAA GCTGGCGATG TGGTGGTAGT GCGCTACGAA
GGCCCACGCG GTGGCCCAGG TATGCGCGAA ATGCTGGCCT TAACCTCGTT GCTCAAAGGC
ATGCCCTTGG GCGAGCAAGT GGCCTTATTG ACCGATGGTC GCTTCAGCGG CGGCACACGC
GGCCTGTGTA TTGGCCATAT CTCCCCCGAA GCCGCCGAGG GTGGCCCAAT TGGCCTGATC
GAAAATGGCG ACATCATTCA CATCGATTTA GCAAATCGCC TGTTGGCAGT TGACCTCAGC
GAAACGCAAT TTGCCGAACG CCGCGCTGCG TGGCAAGCCC CAGAACGCAA ACATCAACGC
GGCTGGTTGG CACGCTACAC CCGTTTGGTG ACGAACGCCA GCAATGGCGC GGTGTTGGAA
GCATAG
 
Protein sequence
MRSDTIKRGF ARAPHRSLLR ATGQIQDDSD FQKPFVAICN SYIDIIPGHV HLHEFAKIVK 
DAVRAAGGIP FEFNTIGVDD GIVMGHEGMR YSLPSRELIA DCVETVAAAH CFDAMICIPN
CDKIVPGMLM GAARVNIPTV FVSGGPMQAG RDKDGNKVDL ISVFEGVGQH ASGRISDERL
LDLERNGCPT CGSCSGMFTA NSMNCLCEAL GIALPYNGSL LATDPARHEL ARTAATKVLE
LLRQNISFSD IVTPESIDNA MALDVAMGGS TNTILHVLAL AREAGLDYPI SRFNEVAARV
PHLAKVSPAW DGTRQWHMED VHRAGGVPAI MAELAKKPDA LHLNVPTVTG QTLGEQLQGI
ANENPECIRP IEHPHSAQGG LCILFGNLAP EGSVIKIGAV DQHQMTFSGP ARCFPSEEAA
TYAARAGEIQ AGDVVVVRYE GPRGGPGMRE MLALTSLLKG MPLGEQVALL TDGRFSGGTR
GLCIGHISPE AAEGGPIGLI ENGDIIHIDL ANRLLAVDLS ETQFAERRAA WQAPERKHQR
GWLARYTRLV TNASNGAVLE A