Gene Haur_3563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3563 
Symbol 
ID5735422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4478386 
End bp4480086 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content53% 
IMG OID641280710 
Productprolyl-tRNA synthetase 
Protein accessionYP_001546327 
Protein GI159900080 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000136034 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGATGA GTAGTGGTTT CGGGCGCACC TTGCGCGAGG CTCCAAGCGA AGCAGAATTA 
GCTGCACATC AATTAATTTT ACGGGCTGGC TTAGCACGGC AATTATTAGC TGGTGGCATG
GCGCTGTTGC CACTGGGCAT GCGAGTATTT CGGCGGATTG AAGCAATTAT GCATGCTGAA
TTAGCTGCTA TCGGTGCTGG TGAATTTCGC ACGCCAGTTG TGCATGCTGC CAGTTTATGG
GAGCAAACCG GACGTTATGC CCAATATGGC GAGGCTATGC TACGCTTCAA CAATCGCAAT
CAACAGGCTT TATTATTTGC GCCAACCCAC GAAGAGGCGG TTGCCGAGCT AGCCCGTCGC
GAGGTTGATT CGTATCGCCA ACTGCCAAGC CTGCTCTACC AAATTCATAC CAAATATCGC
GATGAATTGC GGGTTCGTGG TGGTTTGTTG CGGCTACGCG AATTTACCAT GCTTGATGCC
TATTCGCTTG ATACCGATTG GGCGGGCTTG GATATGGTTT ATGATCGGGT TGCGCTCGCT
TTCGAAACGA TCTTTGAGCG GTGTGGCGTG CGTTTTACCG CTGTCGAAGC CGATGGCGGC
GAGATGGGCG GCCGTGAACC ACGCGAATAT ATGGCGTTTT CGAGCAGCGG CGAAGATAGC
TTGGTGGTTT GCCCGCTCTG CAGTTACGCC GCCAATAGCG AGGTTGCGGT GCGCGGCCAA
GCTGCTGCCA ATGATGATGT CGTACCAGCT ATGAGCGAAA TCGCCACTCC AGCCTGCACC
ACGATTGCCG AACTCGCCAC ATTTTTGCAG GTGAGCGAAG CCCAAACTGC CAAAGCAGTC
TTTTTTAACT CAGCCGAAAA GGGCTTGATT TTCGTGGTGG TGCGCGGTGA TCGTGAAGTC
AATGAAATTA AATTACGAGC GGCGGCAGGT GTTTCGGCGC TTGAGCCAGC CACGCTTGAG
CAAATTAGCG CGGTTGGGGC AGTCGCAGGC TATGCATCGC CAGTTGGCCT GAGCAATGTT
ACAGTAATCG CCGATCATTC GGTGGTTGGC GTTGGTGGCT TGGTTGCTGG AGCCAATCGC
ACAGGCTATC ACTTACAAAA CGTCGTGTAT GGCCGTGATT GGCAAGCAAC CGTGGTTGCT
GATATTGCCA ATGTCGAGGA AGGCGATGCT TGCCCTGTCT GTGGCGCAGC TTTGAGCTTG
GAACGGGGCA TCGAAATTGG TCATATCTTT AAATTAGGCA CTCGTTACAC CGAAGCGCTC
GGCGCAACTT ACCTTGACCC ACAAGGCCAA GCTCAGCCAA TCGTCATGGG TTCGTATGGC
ATTGGCCTCG AACGTTTGTT GCAAGTCATT ATCGAGCAGC ATCACGATGA AAAAGGCATT
GTTTGGCCTG CATCGGTCGC ACCATTCGAT CTGCATTTGG TGCAACTTGG TGCTAGTGCC
ACGGTCAGCG AGGCCGCTAA TCAACTTTAT CAACAATTGA GCGAAGCTGG TCTCAGCGTG
CTCTACGACG ATCGCAATGA ATCGGCGGGA GTCAAATTTA ACGATGCTGA TTTATTGGGC
ATGCCGTTGC GGCTTACGGT TGGCGAACGT GGCCTCAAGC AAAATGTTGT CGAGTTACGC
CAACGAGCAA CTGGGGTAGT CGAGACAATC GCGCTTGATC AAGTGGTGAA GAGTATTAAG
AACATAGAGC ATAGAGCATA G
 
Protein sequence
MRMSSGFGRT LREAPSEAEL AAHQLILRAG LARQLLAGGM ALLPLGMRVF RRIEAIMHAE 
LAAIGAGEFR TPVVHAASLW EQTGRYAQYG EAMLRFNNRN QQALLFAPTH EEAVAELARR
EVDSYRQLPS LLYQIHTKYR DELRVRGGLL RLREFTMLDA YSLDTDWAGL DMVYDRVALA
FETIFERCGV RFTAVEADGG EMGGREPREY MAFSSSGEDS LVVCPLCSYA ANSEVAVRGQ
AAANDDVVPA MSEIATPACT TIAELATFLQ VSEAQTAKAV FFNSAEKGLI FVVVRGDREV
NEIKLRAAAG VSALEPATLE QISAVGAVAG YASPVGLSNV TVIADHSVVG VGGLVAGANR
TGYHLQNVVY GRDWQATVVA DIANVEEGDA CPVCGAALSL ERGIEIGHIF KLGTRYTEAL
GATYLDPQGQ AQPIVMGSYG IGLERLLQVI IEQHHDEKGI VWPASVAPFD LHLVQLGASA
TVSEAANQLY QQLSEAGLSV LYDDRNESAG VKFNDADLLG MPLRLTVGER GLKQNVVELR
QRATGVVETI ALDQVVKSIK NIEHRA