Gene Hhal_2080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2080 
Symbol 
ID4709990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2283726 
End bp2284754 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content70% 
IMG OID639856554 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_001003646 
Protein GI121998859 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCTAA CCGCCGCCCT TCGTCGCATC ACGGAGAACC AGGATCTCAG CCCCGATGAG 
ATGACCGCGG TCTTCCGCAC CATCATGACC GGCGGGGCGA CGCCGGCGCA GATCGGTGGC
TTCCTCATCG GCATGCGGCT CAAGGGGGAG ACGGTCCAGG AGATGGCCGC CGCCGCCTCG
GTCATGCGGG AGCTCGCCGA GCGGGTCGAT GTCGGCGACG ACTTCCACCG CCTGGTGGAC
ACCTGCGGCA CCGGTGGCGA TGCCCGCGGC ACCCTGAACG TCTCGACCGC CGCCGCCTTC
GTGACCGCCG CCGGGGGCAT CCCGGTGGCC AAGCACGGCA ACCGCTCGGT CTCCGGGCGC
AGCGGCAGCG CCGACCTGCT CGAGGCCTGC GGCGCCACGC TGGAACTCAG CAGCGAGGCG
GTGGCTGAGT GCATCCGTCG GGTCAACGTT GGTTTCCTCT TCGCCCCGCT GCACCACAGC
GCCATGAAGC ACGCCGTGGG ACCGCGCAAG GAGCTCGGGG TCCGCACCCT GTTCAACCTG
GTGGGCCCGT TGTCCAACCC CGCCGGGGCG CGGCGCCAGC TGCTCGGGGT CTTCGGGCAG
GAGTGGGTGC GCCCGGTGGC CGAGGTGCTC CAGGCGCTGG GCAGCGACCA CGTGCTGGTG
GTCCACGCCG AGGACGGGCT CGACGAGATC AGCATCGCCG CACCGACGCG GATCGCCGAG
CTGCGCAACG GCCAGATCGA GGAGTACACC GTCACGCCGG AGGATCTGGG GCTGCGCAGC
GCGCCGCTCA ATGAGGTGAC CATCTCCGGC ACCAAGGACA GTCTGGCGAT GATCCGTGCC
GCCTTCTCCG GCGAGCGCAT TGCCGCCATG GAGCTGATCG CCGCCAACGC CGGCGCTGCG
CTCTATGTTG GCGGCGAGGC CCCCGATCTG CGTCGTGGTG TGGAGCGAGC CCGGGAACTC
ATGACCTCCG GTGCCGCCGC TCAGACGCTG GAGCGCTTCG TGGCGACGAC CAAGGAACTC
GCCCAATGA
 
Protein sequence
MDLTAALRRI TENQDLSPDE MTAVFRTIMT GGATPAQIGG FLIGMRLKGE TVQEMAAAAS 
VMRELAERVD VGDDFHRLVD TCGTGGDARG TLNVSTAAAF VTAAGGIPVA KHGNRSVSGR
SGSADLLEAC GATLELSSEA VAECIRRVNV GFLFAPLHHS AMKHAVGPRK ELGVRTLFNL
VGPLSNPAGA RRQLLGVFGQ EWVRPVAEVL QALGSDHVLV VHAEDGLDEI SIAAPTRIAE
LRNGQIEEYT VTPEDLGLRS APLNEVTISG TKDSLAMIRA AFSGERIAAM ELIAANAGAA
LYVGGEAPDL RRGVERAREL MTSGAAAQTL ERFVATTKEL AQ