Gene Haur_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1052 
Symbol 
ID5732956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1201129 
End bp1202280 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content51% 
IMG OID641278187 
Productaminotransferase class I and II 
Protein accessionYP_001543828 
Protein GI159897581 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAGC CAATTCATGG GGCAATTGAT TATGCTGAAT TGCAGCAACG TGGGCTTGTA 
GCCAGCCAAA TTGATGATTT CAGCAGCAAC GTCAATCCGC TTGGCACGCC GAGTTTTATT
CGCGAAGCGC TGGCAACTGT TGATTTGGCG CATTACCCCG ATCGTCAATC GTTGGCCTTG
CGGGCGGCGC TGGCCAAACG CCATTGTTGT GAACTTGAAC AGCTATTGAT TGGTAATGGT
AGCAATGAGT TAATTCATCT GATTGCGCGG GCCTTGTTGC AACCAAACGA TCCAGTTTTG
TTGATTGAAC CAACCTTTGG CGAATATGCC TATGCTAGCA GCTTGGCTGG CGCTCAATTG
TTGCGCTATC AAGCAACCAG CGAAACTGGA TTTGCAATTG ATATTGTAGC TTGTTGTCAT
TTGATCAAGC AACATCGCCC GCGCTTGGTT TGGCTGTGCA ATCCCAATAA TCCCACTGGC
AGCTATTTGG ATGCTGAAGC GATTGCCCAA CTTCAAGCAG CGTGTACCAC AGTTCAAGCC
TATTTGGTGC TCGATTTGGC GTATGCTGAT TTGGTTGTTG GGGATTGGGG ATTGGGGATT
GGGGATTGGG GATTGGGTGA ATCGAATTCC TCTCGCCGAC AAGACGGGCG AGAGTCTGGG
AACGGGGGAA GGTTAACCAG CCCCCAGCTC CCAGCCCCCG ACAACCATCA TCAGATTATT
TATCTCTACT CGTTGACCAA AAGCTATGCC TTGGCGGGGT TGCGTTTGGG CTATGTGGTG
GCTGAGCAAG CGGTTATCGC TCGCTTGCAG CGTTGGCAGC CGCAATGGAG CGTCAATAGT
TTGGCTCAGG CCGCAGGTCT AGCGATTTGC CAACATCCAC ATTGGCTAGC CCAACAGCTT
GAGCAATGGT GGATTTGGAG CGAACAATTA CGCCAGGGTT TGAGCCAACT TAGCTTGAAG
GTCTTGCCAA GCTGCTTGCC ATTTTTCTTA GTTGAAGTGG CGAACGCCCA GCAAACCCGT
AGTGCGCTGC TTAACCACGC TTGTTTGGTG CGCGATTGTA GCTCATTTGG TTTGCCGCAG
TTTGTACGGA TCGCCCCGCG CCAACCAGCG GCAAATCAAC GCTTGTTGAA TGCTTGGAGA
AGTTTATGCT AG
 
Protein sequence
MQQPIHGAID YAELQQRGLV ASQIDDFSSN VNPLGTPSFI REALATVDLA HYPDRQSLAL 
RAALAKRHCC ELEQLLIGNG SNELIHLIAR ALLQPNDPVL LIEPTFGEYA YASSLAGAQL
LRYQATSETG FAIDIVACCH LIKQHRPRLV WLCNPNNPTG SYLDAEAIAQ LQAACTTVQA
YLVLDLAYAD LVVGDWGLGI GDWGLGESNS SRRQDGRESG NGGRLTSPQL PAPDNHHQII
YLYSLTKSYA LAGLRLGYVV AEQAVIARLQ RWQPQWSVNS LAQAAGLAIC QHPHWLAQQL
EQWWIWSEQL RQGLSQLSLK VLPSCLPFFL VEVANAQQTR SALLNHACLV RDCSSFGLPQ
FVRIAPRQPA ANQRLLNAWR SLC