Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4450 |
Symbol | |
ID | 5736301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5692652 |
End bp | 5694337 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641281613 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001547210 |
Protein GI | 159900963 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0129872 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTCAG ATACGATCAA GCGGGGCTTT GCACGCGCTC CACATCGTTC ACTCCTACGT GCGACGGGCC AAATTCAAGA TGATAGCGAC TTTCAAAAGC CGTTCGTCGC GATCTGTAAT TCGTATATCG ATATTATTCC TGGCCACGTT CACTTGCATG AGTTTGCCAA AATCGTCAAA GATGCAGTAC GGGCAGCAGG CGGCATTCCC TTTGAATTCA ACACGATTGG GGTTGATGAT GGCATCGTGA TGGGCCACGA AGGTATGCGC TATTCCTTGC CATCGCGCGA GTTGATCGCC GATTGTGTTG AAACGGTGGC AGCAGCCCAC TGTTTCGATG CGATGATCTG TATCCCCAAC TGCGATAAAA TCGTGCCTGG CATGCTGATG GGCGCTGCCC GCGTCAACAT CCCCACCGTT TTCGTATCGG GCGGGCCAAT GCAGGCTGGC CGCGATAAAG ATGGCAATAA AGTTGACTTG ATCAGCGTGT TCGAGGGCGT TGGTCAACAT GCTTCTGGCC GCATCAGTGA TGAACGGCTG CTGGATCTGG AGCGCAACGG CTGCCCGACC TGCGGCTCCT GCTCCGGCAT GTTCACCGCC AATTCGATGA ATTGTCTGTG TGAAGCACTG GGAATCGCCC TGCCATACAA CGGTTCATTA CTGGCCACCG ACCCTGCCCG CCACGAGCTA GCCCGCACCG CCGCCACCAA AGTCCTCGAA TTGCTCCGCC AGAATATTTC CTTCAGCGAT ATTGTCACTC CCGAATCAAT TGATAACGCG ATGGCGCTCG ACGTAGCCAT GGGCGGCTCG ACCAACACCA TCTTGCACGT TTTGGCCTTG GCGCGTGAAG CAGGCTTGGA TTACCCAATC AGCCGCTTTA ACGAGGTTGC AGCTCGCGTA CCACACTTGG CCAAAGTTAG CCCAGCCTGG GATGGCACTC GCCAATGGCA TATGGAAGAT GTGCATCGCG CTGGTGGCGT ACCAGCAATT ATGGCCGAAT TGGCCAAAAA GCCCGATGCC CTACACTTGA ATGTGCCAAC CGTGACAGGC CAAACCTTGG GTGAACAATT GCAAGGCATC GCCAACGAAA ACCCTGAATG CATCCGCCCA ATCGAACATC CCCATTCAGC CCAAGGCGGT TTGTGTATTT TGTTTGGCAA CTTGGCTCCT GAAGGCTCGG TGATCAAAAT TGGTGCAGTT GATCAACATC AAATGACATT TAGCGGCCCG GCCCGTTGTT TCCCCAGCGA AGAAGCCGCC ACCTACGCCG CCCGCGCTGG CGAAATTCAA GCTGGCGATG TGGTGGTAGT GCGCTACGAA GGCCCACGCG GTGGCCCAGG TATGCGCGAA ATGCTGGCCT TAACCTCGTT GCTCAAAGGC ATGCCCTTGG GCGAGCAAGT GGCCTTATTG ACCGATGGTC GCTTCAGCGG CGGCACACGC GGCCTGTGTA TTGGCCATAT CTCCCCCGAA GCCGCCGAGG GTGGCCCAAT TGGCCTGATC GAAAATGGCG ACATCATTCA CATCGATTTA GCAAATCGCC TGTTGGCAGT TGACCTCAGC GAAACGCAAT TTGCCGAACG CCGCGCTGCG TGGCAAGCCC CAGAACGCAA ACATCAACGC GGCTGGTTGG CACGCTACAC CCGTTTGGTG ACGAACGCCA GCAATGGCGC GGTGTTGGAA GCATAG
|
Protein sequence | MRSDTIKRGF ARAPHRSLLR ATGQIQDDSD FQKPFVAICN SYIDIIPGHV HLHEFAKIVK DAVRAAGGIP FEFNTIGVDD GIVMGHEGMR YSLPSRELIA DCVETVAAAH CFDAMICIPN CDKIVPGMLM GAARVNIPTV FVSGGPMQAG RDKDGNKVDL ISVFEGVGQH ASGRISDERL LDLERNGCPT CGSCSGMFTA NSMNCLCEAL GIALPYNGSL LATDPARHEL ARTAATKVLE LLRQNISFSD IVTPESIDNA MALDVAMGGS TNTILHVLAL AREAGLDYPI SRFNEVAARV PHLAKVSPAW DGTRQWHMED VHRAGGVPAI MAELAKKPDA LHLNVPTVTG QTLGEQLQGI ANENPECIRP IEHPHSAQGG LCILFGNLAP EGSVIKIGAV DQHQMTFSGP ARCFPSEEAA TYAARAGEIQ AGDVVVVRYE GPRGGPGMRE MLALTSLLKG MPLGEQVALL TDGRFSGGTR GLCIGHISPE AAEGGPIGLI ENGDIIHIDL ANRLLAVDLS ETQFAERRAA WQAPERKHQR GWLARYTRLV TNASNGAVLE A
|
| |