Gene Haur_3835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3835 
Symbol 
ID5735700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4814738 
End bp4816138 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content54% 
IMG OID641280988 
Producthistidine--tRNA ligase 
Protein accessionYP_001546599 
Protein GI159900352 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase
[TIGR00443] ATP phosphoribosyltransferase, regulatory subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000618349 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGCTATC AATTAACTCC TGAAGTGGTT CGCGGAACCC GTGACCTTTT TGCCGCTGCC 
GTTCACCAAC GTCAGGCGCT GATTCAGACA TTAACCGCTA CTTTTGATCA AGCTGGCTAT
GACCCGATTG AGTTGCCATT GCTTGAGCAT CGTGAGTTGT ATTTAAAAAA ATCTGGTGAT
GATCTCATCG CCAAATTGTA CCATTGGAAT CAAGGTGGCC GTGATTTGGC CTTACGCCCT
GAATGGACAG CCTCGGTCTT ACGAGCAGTC ATCAGTGGCA TGGGCGATGC GCCAATTCCC
TTACGCTTGC GCTACGCTGG GCCAGTGTTT CGCTATGAAC GCCCACGTCG CGCGACCTAT
CGGCAATTTA CCCAAGTTGG CATCGAATTA ATCGGTGCAC CTGGCCCGTT GGCCGATGCT
GAAGCCCTTG GTTTGGCTGT CACTGGCCTG CGCGAATTGG GCATTCAGCA ATGGACGTTA
ACCATTGGCC ATATTGGTGT AATTAAGACG TTGCTCAATA GCCTCGGCTT ACCCGAACGG
ATCACGTCGG CGCTCACCTG GAGCCTCGAA CGGATTCGTT CTAAAGGGCT TGATGCGGTT
AAACAACAAT GGCGCGACGA CGACGACGAT CTGCCAGTTG ATTTAGCCAG CCTAGCCCAT
CTCGCCGACC AAGATCTTGA GACGCTGTTG TTGCGCGTAT TACCCAGTCT TGGGGTGCGC
CTCGATAGCG GCGGGCGTGA GCCACAAGCA ATCATTCAAC GCTTGGTGCG CAAGCTGCGC
CGTGGCGACG ATGCGCTTAA TCTTGATCGC GCATGGCAGT TGCTTTCAGC TTTAACCGCT
GCCCGTGGCT CGGCACCAGA TGTGATGCAA CAGATGCGCG AGCTATGCCA AGAATATACG
GTTGCACCCG ACGCTTTGGA TGAGTTGCAA ACCACCCTGA CCTTACTTGA GGCATATGGC
GTGCCAGCAG ATCAGATTGT GCTGGATTTT GGCATGGGTC GTGGCTTGCA CTACTACACT
GGCCTGATTT TCGAGATCGA TGGCGCTGAT GGCTTGCAAC TCTGTGGCGG TGGCCGCTAT
GATGATTTAG TTGCGGCGCT TGGTGGGCGA GCAATGCCAG CCGTTGGCTT TGCCTATGGA
CTCGAACGAA TCGTCGCAGC AGTTGCGCCA GCCGAAATTG CCCCAGTTAA AAGTGTGTTG
GTTGTTGGCG ATGATCATGG CTTGGTAATT CAGGCAGCAG CAGCACTCCG CCAACAAGGC
TATCGCACGG CAGTTGATTT GCGCCAGCGT TCGTATGCGG CCAATTTAAA TGATGCCCGT
CGGCGTGAAA TGAGCCATCT GGCCTTGGTT AGTGTTGATG GTATTCAACT GCGCGATTTA
AATGAACAGA AACCCCAATG A
 
Protein sequence
MGYQLTPEVV RGTRDLFAAA VHQRQALIQT LTATFDQAGY DPIELPLLEH RELYLKKSGD 
DLIAKLYHWN QGGRDLALRP EWTASVLRAV ISGMGDAPIP LRLRYAGPVF RYERPRRATY
RQFTQVGIEL IGAPGPLADA EALGLAVTGL RELGIQQWTL TIGHIGVIKT LLNSLGLPER
ITSALTWSLE RIRSKGLDAV KQQWRDDDDD LPVDLASLAH LADQDLETLL LRVLPSLGVR
LDSGGREPQA IIQRLVRKLR RGDDALNLDR AWQLLSALTA ARGSAPDVMQ QMRELCQEYT
VAPDALDELQ TTLTLLEAYG VPADQIVLDF GMGRGLHYYT GLIFEIDGAD GLQLCGGGRY
DDLVAALGGR AMPAVGFAYG LERIVAAVAP AEIAPVKSVL VVGDDHGLVI QAAAALRQQG
YRTAVDLRQR SYAANLNDAR RREMSHLALV SVDGIQLRDL NEQKPQ