Gene Hhal_2082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2082 
Symbol 
ID4709984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2285396 
End bp2286871 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content70% 
IMG OID639856556 
Productanthranilate synthase component I 
Protein accessionYP_001003648 
Protein GI121998861 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAGC ACGAGTTCCA CAGCCTCGCC GAAGCCGGGT ACACCCGTAT CCCGCTCATC 
CGCGAGGTCC TGGCCGATCT CGACACCCCG CTGTCCACCT ATCTCAAGCT CGCCCGCGGG
CCGCGCTCCT TCCTCTTCGA GTCGGTGGAG GGCGGTGAGA AGTGGGGCCG CTACTCCATC
ATCGGGCTGC CGGCCCGGAC CGAGATCGCC GTCTACGGGC ACCGGGTTGA GGTCCGCCGC
GATGGCGCGC TCGTCGACGA GCAGCAGGTG CGCGATCCGC TGGCCTGGAT CGAGGCGTAT
CAGCACCGTC ATCGCGCCGC GACGCCCTCC GGATTCCCGC AGATGCCGCG GTTCCTGGGG
GGGCTGGTGG GGTACTTCGG CTACGACACG GTGCGCTACG TCGAGCCGCG GCTGGCCGCT
AGCGCCCCGG AAGACCCCCT CGGGCTGCCG GACATCTGCC TGGTGGAGGC CGAGGAGGTG
GTCGTCTTCG ACAACCTCGC CGGGCGGCTC TATCTGGTGG TCCACTGCGA TCCGCGCGAG
CCGGACGCCT ATGCAGCCGG CCTGGCACGG CTGGACGAAC TGGCCGGGCG GCTGGCCGAG
CCCCTGGAGC GCACCCGGCG TCCGGGCGGC GGCGTGGCGG CGGGGGAGCC CCGCTACAAC
TTCACCCAGG CCGGGTTCGA GGCCGCGGTG GAGCGGATCC GCGAGTACAT CGCCGCCGGC
GACGTGATGC AGGTGGTCCC GTCGCAGCGC ATGAGCATGC CGTTCAGCGC GGAGCCCCTC
GATCTCTACC GCGCGTTGCG CTGCACCAAC CCCTCGCCCT ACATGTACTT CCTCGACCTC
GGCGCCTGCC AGGTGGTCGG CTCCTCGCCG GAGATCCTGG TCCGCCTCGA GGATGGCGAG
GTGGCCGTGC GCCCGATTGC CGGGACCCGC AAGCGCGGCG CCACCGAGGC CCGCGACCAG
GAGCTGGAAC AGGAGCTGGT CTCCGACCCG AAGGAGATCG CCGAGCACGT GATGCTCATC
GACCTCGGGC GCAACGACGT CGGCCGGGTC GCCGAGACCG GCACGGTGCG CCTGACCGAG
CGCATGGTGG TGGAGCGCTA CTCCCAGGTC ATGCACATCG TCTCCAACGT GGTCGGGCGG
CTGCGACCGG GGCTCGGTCC CATGGACGTG CTGCGGGCCA CCTTCCCGGC CGGCACGGTC
TCCGGCGCGC CGAAGATCCG TGCCATGGAG ATCATCGACG AGGTCGAGCC GGTCAAGCGC
GGTGTCTACG CCGGCGCCGT CGGCTATCTC TCCTGGTCGG GGAACATGGA CACCGCCATC
GCGATCCGGA CCGCGGTGGT TCACGATGGC CAGGTCCACG TCCAGGCGGG TGCCGGTGTG
GTCGCAGACT CGGTTCCCGA GCTGGAGTGG AAGGAGACCC TGAACAAGGG CCGGGCGCTG
CTGCGCGCCG TCGAGATGGC CGAGGCTGGC TTATGA
 
Protein sequence
MKEHEFHSLA EAGYTRIPLI REVLADLDTP LSTYLKLARG PRSFLFESVE GGEKWGRYSI 
IGLPARTEIA VYGHRVEVRR DGALVDEQQV RDPLAWIEAY QHRHRAATPS GFPQMPRFLG
GLVGYFGYDT VRYVEPRLAA SAPEDPLGLP DICLVEAEEV VVFDNLAGRL YLVVHCDPRE
PDAYAAGLAR LDELAGRLAE PLERTRRPGG GVAAGEPRYN FTQAGFEAAV ERIREYIAAG
DVMQVVPSQR MSMPFSAEPL DLYRALRCTN PSPYMYFLDL GACQVVGSSP EILVRLEDGE
VAVRPIAGTR KRGATEARDQ ELEQELVSDP KEIAEHVMLI DLGRNDVGRV AETGTVRLTE
RMVVERYSQV MHIVSNVVGR LRPGLGPMDV LRATFPAGTV SGAPKIRAME IIDEVEPVKR
GVYAGAVGYL SWSGNMDTAI AIRTAVVHDG QVHVQAGAGV VADSVPELEW KETLNKGRAL
LRAVEMAEAG L