Gene Hhal_2111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2111 
Symbol 
ID4710041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2316090 
End bp2317187 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content70% 
IMG OID639856585 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001003677 
Protein GI121998890 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTGAGG TCGAGGCGCG GGTCGCCCGC TGGGTGCGGC CACAGGTGCA GGCCCTGGAG 
GCCTATCAGG TGGCCGAGCC GGGCAAGGCC ATCAAGCTCG ATGCCATGGA GAGCCCATGG
GCCTGGCCCG GGGCCCTGGA AGAGGCCTGG CTGGAGCGCA TGCGTTCGGT GTCCGTGAAC
CGCTATCCGG ACCCGGCGGC CCGGCGGCTC AAGCCCCTGC TGCGCGAGGG GTTGGGGGTC
CCCGAGGGGG CAGAGCTGTT GCTCGGCAAC GGCTCCGATG AGCTCATCCA GCTCATCGAT
CTGGCCGTGG CTGGCAGTGG GCGCACGGTG ATGGCCCCGG GGCCGAGTTT TGCCATGTAC
CGGATCATCG CCGAGTATAC CGGCGCCGAA TACGTCGAGG TGCCGCTCGA TGCGGAGTTC
GGGCTGGATC TCGCCGCCAC CCGGGAGGCG GTGTCGGCGT ACAACCCGGC GGTCACCTAC
CTGGCGCACC CGAACAACCC CACCGGCAAT GGCCTCGATC TGGACGCCGT GGCGGAGCTG
GTGGCGCAGA GCGACGGCCT GGTGGTAGTC GATGAGGCCT ACGCCCCCTA CGCCGACAGC
AGCTTCCTGC CGCGGGTGCT GGAGTTCCCC AACTGCCTGG TGCTGCGCAC GCTCTCTAAG
GTCGGTCTGG CGGGCCTGCG GGTCGGGGTG CTGATCGGCC ATCCGGCCTG GATCGACCAG
CTGGAGAAGT GTCGCCTGCC CTACAACCTG GGCAGCCTGG CCCAGGCCAG TGCGGCATTC
GCCGTCGAGC ACCAGGAGGC CCTGGATCGC TGTGTGGCCC ACGTGCTCGG CGAACGGGCG
CGGCTGGTCG AGGAGCTGCC GGCGGTCCCC GGTGTCGAGC AGGTCTGGCC GACGCAGACC
AACTTCCTCA CCTTCCGGGT GCCGCAGGGC AGTGCCGATG CCGTGCACCG TGGTCTGCTC
GATCGAGGGG TCCTGATCAA GCGCCTGCAC GGCAGCCATC CGCGGCTGGA GGACTGCCTG
CGGGTGACGG TCGGTCGCCC CGAGGAGAAC AACCGCTTCC TCGAGGCGCT GGCCGAGACC
CTCGCCGTGG CGGCCTGA
 
Protein sequence
MTEVEARVAR WVRPQVQALE AYQVAEPGKA IKLDAMESPW AWPGALEEAW LERMRSVSVN 
RYPDPAARRL KPLLREGLGV PEGAELLLGN GSDELIQLID LAVAGSGRTV MAPGPSFAMY
RIIAEYTGAE YVEVPLDAEF GLDLAATREA VSAYNPAVTY LAHPNNPTGN GLDLDAVAEL
VAQSDGLVVV DEAYAPYADS SFLPRVLEFP NCLVLRTLSK VGLAGLRVGV LIGHPAWIDQ
LEKCRLPYNL GSLAQASAAF AVEHQEALDR CVAHVLGERA RLVEELPAVP GVEQVWPTQT
NFLTFRVPQG SADAVHRGLL DRGVLIKRLH GSHPRLEDCL RVTVGRPEEN NRFLEALAET
LAVAA