Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4444 |
Symbol | |
ID | 5736295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5684698 |
End bp | 5686293 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281607 |
Product | putative alpha-isopropylmalate/homocitrate synthase family transferase |
Protein accession | YP_001547204 |
Protein GI | 159900957 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATAT TTTTATATGA TACAACGCTG CGGGACGGCA CTCAACGCGA AGGGCTTTCG TTATCTTTAG CTGATAAATT GAAAATCGCT CGCGAACTTG ATCGCTTTGG CATGCACTAT ATCGAAGGCG GTTGGCCTGG CTCGAACCCC AAGGATGCAG CATTTTTCGC CGAAGCTGCC AAAATGGAAT GGAAGCACGC CAAAATTGCC GCTTTTGGTT CAACCCGCCG TGCCAACAGC AAACCTGAAA CCGATGCCAA TCTCAAGGCT TTGCTCGATG CGAACACACC TGTGGTAACC TTGGTTGGCA AATCGTGGAC ATTGCACGTA ACTGAGGTGT TGGAAACGAC GCTCGAAGAA AATTTGGCCA TGATTCGCGA TAGCGTGGCG CTGATGAAAG CCCATGGCAA AGAAGTGATT TACGATGCTG AACACTTTTT TGATGGCTAC CGCGCTGATA ACGACTATGC CCTAGCCACA ATCAAGGCTG CCGCCGAAGC TGGCGCAGAT TGGATTGTGC TATGCGATAC CAACGGCGGC TCGTTGCCCG ATTGGATTAG CGCCGTGGTG CAGCGGGTCA AGGGTAAAAT CAACACTCAA TTGGGCATTC ACACCCACAA CGATAGCGAG TTGGCGGTAG CAAATTCCTT GGCCGCGATT GTTGGTGGTT GCCGCCAAGT TCAGGGCACG ATTAACGGCT ATGGCGAGCG CTGCGGTAAC GCCAACTTAA TCTCGATCAT TCCCAATTTG CAACTGAAAA TGGGCATGTT CTGTGTGCTG CCTGATCAAT TACAACGTTT GACCGAGCTT TCACGCACCG TCAGCGAAAT TGCCAATTTG AACCCCGACG AGCATGCCGC CTATGTTGGC AACAGCGCGT TTGCCCATAA AGGCGGGATT CATGTCGCAG CTGTGGCAAA AGTCGAGCAT TCATACCAAC ATATCGAGCC AGTTCAAGTG GGCAATCGCA AACGGGTAGT GATCAGCGAG CTTTCAGGCC GTGGTAACAT CAAAATGCGA GCCGAAGAAT TGGGCGTGGA AAGCACAGGT CTCGAACGCG GCGTGCTCGA ACGCGTCAAA CTGCTTGAAA GCAAGGGCTT TCAGTTTGAA GCTGCCGAAG GTTCATTTGA ACTTTTGGTG CGCCGCGCCG CTGCCGATTA TGCAGCGCCC TTCAAATTGC TCGATGTTGT CACGATTGTT GAGCAACGAC GGGGGGTCGA GATGCAGGCC GAGGCGACAG TTAAGCTACA AATTGGCGAG GAAATTTATC ATACAGCAGC TTCGGGCAAT GGCCCAGTTA ACGCACTCGA CCAAGCCATG CGCAAAGCCT TGCTCTCACG CTACCCCGAA TTGGCCGAAG TCCATTTGGT CGATTACAAA GTGCGGATTC TCGATTCAGA ATCGGCGACC GGAGCGACCA CCCGGGTGTT GATTGAAGCA GCCATGGGCG ACGAACGTTG GACAACCGTC GGCTGCTCCG AAAATATTAT CGAAGCCAGT TGGCAAGCCT TGGTTGATTC GCTGGAATTG CCCTTGGTTC GTGCTCGCAG CAACCAGCCA GTGCTACTCA AACACGCCGC AACTGTGGCC GCCTAA
|
Protein sequence | MQIFLYDTTL RDGTQREGLS LSLADKLKIA RELDRFGMHY IEGGWPGSNP KDAAFFAEAA KMEWKHAKIA AFGSTRRANS KPETDANLKA LLDANTPVVT LVGKSWTLHV TEVLETTLEE NLAMIRDSVA LMKAHGKEVI YDAEHFFDGY RADNDYALAT IKAAAEAGAD WIVLCDTNGG SLPDWISAVV QRVKGKINTQ LGIHTHNDSE LAVANSLAAI VGGCRQVQGT INGYGERCGN ANLISIIPNL QLKMGMFCVL PDQLQRLTEL SRTVSEIANL NPDEHAAYVG NSAFAHKGGI HVAAVAKVEH SYQHIEPVQV GNRKRVVISE LSGRGNIKMR AEELGVESTG LERGVLERVK LLESKGFQFE AAEGSFELLV RRAAADYAAP FKLLDVVTIV EQRRGVEMQA EATVKLQIGE EIYHTAASGN GPVNALDQAM RKALLSRYPE LAEVHLVDYK VRILDSESAT GATTRVLIEA AMGDERWTTV GCSENIIEAS WQALVDSLEL PLVRARSNQP VLLKHAATVA A
|
| |