Gene Haur_2246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2246 
Symbol 
ID5734133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2862943 
End bp2864247 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content53% 
IMG OID641279387 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001545014 
Protein GI159898767 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000301109 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAATTA CTCCTCGCGC CTACAAAGGC ATGCGCGATC ACTTGCCCGA AGCTATGCGC 
TTGCGGCGTT TTATCACCGA TACCTTGATT GGTATTTTAG AGCGCTATGG CTTTGAGCCA
CTTTCCACAC CGATTGTCGA ATATTCGGAA ACGCTCGAAG GCAAGATTGG CGATGAAGAA
AAATTGCTGT ATCGTTTGAA ATATGGCGAT GATGCTTTGA CCTTGCGTTA CGACCAAACT
GTGCCCTTGG CGCGGGTGGT GGCCCAAAAC GAAGGCAAAT TAACTATGCC CTTCAAACGC
TATGCGCTCG GCCAATCGTA TCGCGGCGAA CGCCAAGCTC GTGGTCGCTA CCGCGAATTT
TGGCAGCTTG ATGCCGATAT TGTTGGGGTA GATAGCCCGA TTGCCGATGC TGAAATTGTG
GCGGTGGTCG TTGAAGGCTT GCGAGCTTTA GGCTTTACTG GTGCAAAAGT CTTGCTCAAT
CACCGCGAAA TCTTGAGTGG GTTGGCGCGG GTAGCAGGTG TGCCCGAAAA CGAGGCAGGT
GGAGTATATC GCGCGATCGA TAAGCTCGAT AAAATTGGCA ATGACGGTGT GCGCAACGAA
TTGCTCAAAA GTGGCGTGAG TGCCGAAGCT GCTGAGCGTG TGTTGCACTT CGTCGGGATT
AGCGGCTCGA TCGAAGCGGT GTTGGCCGAA ATGGAAAGCG TGCTAGCCAA CGATCCGCCA
GCTTTGGCAG CGGTTGCGGC CTTGCGCACC ATCTGCGAGG TATTGACGAG CTTTGGCGTG
CCAGCCGATA GTTTTACGAT TGCGCCTAGC TTGGCTCGCG GCTTATCCTA CTATACGGGC
TGTGTGTTCG AGGCCGTGCT CGATTCGCCA CCGATGGGTT CGTTATTGGG CGGTGGCCGC
TACGACAATT TGGTGGGTAT GTTTAGCAAA CGCTCGTTGC CGACGGTTGG CTGTGCTTTT
GGGCTTGAAC GCTTGTTTGA TTTGATGCTT GAGCTTAATA TGGGGCCACG CCCAGAGCGC
ACGATCGATG CCTATGTTAC CTTATTTGCT GGCGATTTTC AAAATGAGAG CCTGCGCTTG
GCGGGCGAAT TGCGGGTAGC TGGCCTGAGT GTATTGACCG CGTATAGTCC AGTCAAAATT
GCCAACCAGT TCAAAGAGGC AGATCGCAAG GGTGCGAATT TTGCCTTGGT GCTTGGCCCC
GATGATTTGG CTGCAGGCGT AGTACAGCTC AAAGATTTAC GCTCGGGTCA GCAACAAGCT
GTAGCGCGTG ATGCGATCGT AGCGGCAATC AAAGCGGCCC AATAA
 
Protein sequence
MPITPRAYKG MRDHLPEAMR LRRFITDTLI GILERYGFEP LSTPIVEYSE TLEGKIGDEE 
KLLYRLKYGD DALTLRYDQT VPLARVVAQN EGKLTMPFKR YALGQSYRGE RQARGRYREF
WQLDADIVGV DSPIADAEIV AVVVEGLRAL GFTGAKVLLN HREILSGLAR VAGVPENEAG
GVYRAIDKLD KIGNDGVRNE LLKSGVSAEA AERVLHFVGI SGSIEAVLAE MESVLANDPP
ALAAVAALRT ICEVLTSFGV PADSFTIAPS LARGLSYYTG CVFEAVLDSP PMGSLLGGGR
YDNLVGMFSK RSLPTVGCAF GLERLFDLML ELNMGPRPER TIDAYVTLFA GDFQNESLRL
AGELRVAGLS VLTAYSPVKI ANQFKEADRK GANFALVLGP DDLAAGVVQL KDLRSGQQQA
VARDAIVAAI KAAQ