Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2082 |
Symbol | |
ID | 4709984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2285396 |
End bp | 2286871 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639856556 |
Product | anthranilate synthase component I |
Protein accession | YP_001003648 |
Protein GI | 121998861 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAGC ACGAGTTCCA CAGCCTCGCC GAAGCCGGGT ACACCCGTAT CCCGCTCATC CGCGAGGTCC TGGCCGATCT CGACACCCCG CTGTCCACCT ATCTCAAGCT CGCCCGCGGG CCGCGCTCCT TCCTCTTCGA GTCGGTGGAG GGCGGTGAGA AGTGGGGCCG CTACTCCATC ATCGGGCTGC CGGCCCGGAC CGAGATCGCC GTCTACGGGC ACCGGGTTGA GGTCCGCCGC GATGGCGCGC TCGTCGACGA GCAGCAGGTG CGCGATCCGC TGGCCTGGAT CGAGGCGTAT CAGCACCGTC ATCGCGCCGC GACGCCCTCC GGATTCCCGC AGATGCCGCG GTTCCTGGGG GGGCTGGTGG GGTACTTCGG CTACGACACG GTGCGCTACG TCGAGCCGCG GCTGGCCGCT AGCGCCCCGG AAGACCCCCT CGGGCTGCCG GACATCTGCC TGGTGGAGGC CGAGGAGGTG GTCGTCTTCG ACAACCTCGC CGGGCGGCTC TATCTGGTGG TCCACTGCGA TCCGCGCGAG CCGGACGCCT ATGCAGCCGG CCTGGCACGG CTGGACGAAC TGGCCGGGCG GCTGGCCGAG CCCCTGGAGC GCACCCGGCG TCCGGGCGGC GGCGTGGCGG CGGGGGAGCC CCGCTACAAC TTCACCCAGG CCGGGTTCGA GGCCGCGGTG GAGCGGATCC GCGAGTACAT CGCCGCCGGC GACGTGATGC AGGTGGTCCC GTCGCAGCGC ATGAGCATGC CGTTCAGCGC GGAGCCCCTC GATCTCTACC GCGCGTTGCG CTGCACCAAC CCCTCGCCCT ACATGTACTT CCTCGACCTC GGCGCCTGCC AGGTGGTCGG CTCCTCGCCG GAGATCCTGG TCCGCCTCGA GGATGGCGAG GTGGCCGTGC GCCCGATTGC CGGGACCCGC AAGCGCGGCG CCACCGAGGC CCGCGACCAG GAGCTGGAAC AGGAGCTGGT CTCCGACCCG AAGGAGATCG CCGAGCACGT GATGCTCATC GACCTCGGGC GCAACGACGT CGGCCGGGTC GCCGAGACCG GCACGGTGCG CCTGACCGAG CGCATGGTGG TGGAGCGCTA CTCCCAGGTC ATGCACATCG TCTCCAACGT GGTCGGGCGG CTGCGACCGG GGCTCGGTCC CATGGACGTG CTGCGGGCCA CCTTCCCGGC CGGCACGGTC TCCGGCGCGC CGAAGATCCG TGCCATGGAG ATCATCGACG AGGTCGAGCC GGTCAAGCGC GGTGTCTACG CCGGCGCCGT CGGCTATCTC TCCTGGTCGG GGAACATGGA CACCGCCATC GCGATCCGGA CCGCGGTGGT TCACGATGGC CAGGTCCACG TCCAGGCGGG TGCCGGTGTG GTCGCAGACT CGGTTCCCGA GCTGGAGTGG AAGGAGACCC TGAACAAGGG CCGGGCGCTG CTGCGCGCCG TCGAGATGGC CGAGGCTGGC TTATGA
|
Protein sequence | MKEHEFHSLA EAGYTRIPLI REVLADLDTP LSTYLKLARG PRSFLFESVE GGEKWGRYSI IGLPARTEIA VYGHRVEVRR DGALVDEQQV RDPLAWIEAY QHRHRAATPS GFPQMPRFLG GLVGYFGYDT VRYVEPRLAA SAPEDPLGLP DICLVEAEEV VVFDNLAGRL YLVVHCDPRE PDAYAAGLAR LDELAGRLAE PLERTRRPGG GVAAGEPRYN FTQAGFEAAV ERIREYIAAG DVMQVVPSQR MSMPFSAEPL DLYRALRCTN PSPYMYFLDL GACQVVGSSP EILVRLEDGE VAVRPIAGTR KRGATEARDQ ELEQELVSDP KEIAEHVMLI DLGRNDVGRV AETGTVRLTE RMVVERYSQV MHIVSNVVGR LRPGLGPMDV LRATFPAGTV SGAPKIRAME IIDEVEPVKR GVYAGAVGYL SWSGNMDTAI AIRTAVVHDG QVHVQAGAGV VADSVPELEW KETLNKGRAL LRAVEMAEAG L
|
| |