Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3563 |
Symbol | |
ID | 5735422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4478386 |
End bp | 4480086 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280710 |
Product | prolyl-tRNA synthetase |
Protein accession | YP_001546327 |
Protein GI | 159900080 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0442] Prolyl-tRNA synthetase |
TIGRFAM ID | [TIGR00409] prolyl-tRNA synthetase, family II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000136034 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGATGA GTAGTGGTTT CGGGCGCACC TTGCGCGAGG CTCCAAGCGA AGCAGAATTA GCTGCACATC AATTAATTTT ACGGGCTGGC TTAGCACGGC AATTATTAGC TGGTGGCATG GCGCTGTTGC CACTGGGCAT GCGAGTATTT CGGCGGATTG AAGCAATTAT GCATGCTGAA TTAGCTGCTA TCGGTGCTGG TGAATTTCGC ACGCCAGTTG TGCATGCTGC CAGTTTATGG GAGCAAACCG GACGTTATGC CCAATATGGC GAGGCTATGC TACGCTTCAA CAATCGCAAT CAACAGGCTT TATTATTTGC GCCAACCCAC GAAGAGGCGG TTGCCGAGCT AGCCCGTCGC GAGGTTGATT CGTATCGCCA ACTGCCAAGC CTGCTCTACC AAATTCATAC CAAATATCGC GATGAATTGC GGGTTCGTGG TGGTTTGTTG CGGCTACGCG AATTTACCAT GCTTGATGCC TATTCGCTTG ATACCGATTG GGCGGGCTTG GATATGGTTT ATGATCGGGT TGCGCTCGCT TTCGAAACGA TCTTTGAGCG GTGTGGCGTG CGTTTTACCG CTGTCGAAGC CGATGGCGGC GAGATGGGCG GCCGTGAACC ACGCGAATAT ATGGCGTTTT CGAGCAGCGG CGAAGATAGC TTGGTGGTTT GCCCGCTCTG CAGTTACGCC GCCAATAGCG AGGTTGCGGT GCGCGGCCAA GCTGCTGCCA ATGATGATGT CGTACCAGCT ATGAGCGAAA TCGCCACTCC AGCCTGCACC ACGATTGCCG AACTCGCCAC ATTTTTGCAG GTGAGCGAAG CCCAAACTGC CAAAGCAGTC TTTTTTAACT CAGCCGAAAA GGGCTTGATT TTCGTGGTGG TGCGCGGTGA TCGTGAAGTC AATGAAATTA AATTACGAGC GGCGGCAGGT GTTTCGGCGC TTGAGCCAGC CACGCTTGAG CAAATTAGCG CGGTTGGGGC AGTCGCAGGC TATGCATCGC CAGTTGGCCT GAGCAATGTT ACAGTAATCG CCGATCATTC GGTGGTTGGC GTTGGTGGCT TGGTTGCTGG AGCCAATCGC ACAGGCTATC ACTTACAAAA CGTCGTGTAT GGCCGTGATT GGCAAGCAAC CGTGGTTGCT GATATTGCCA ATGTCGAGGA AGGCGATGCT TGCCCTGTCT GTGGCGCAGC TTTGAGCTTG GAACGGGGCA TCGAAATTGG TCATATCTTT AAATTAGGCA CTCGTTACAC CGAAGCGCTC GGCGCAACTT ACCTTGACCC ACAAGGCCAA GCTCAGCCAA TCGTCATGGG TTCGTATGGC ATTGGCCTCG AACGTTTGTT GCAAGTCATT ATCGAGCAGC ATCACGATGA AAAAGGCATT GTTTGGCCTG CATCGGTCGC ACCATTCGAT CTGCATTTGG TGCAACTTGG TGCTAGTGCC ACGGTCAGCG AGGCCGCTAA TCAACTTTAT CAACAATTGA GCGAAGCTGG TCTCAGCGTG CTCTACGACG ATCGCAATGA ATCGGCGGGA GTCAAATTTA ACGATGCTGA TTTATTGGGC ATGCCGTTGC GGCTTACGGT TGGCGAACGT GGCCTCAAGC AAAATGTTGT CGAGTTACGC CAACGAGCAA CTGGGGTAGT CGAGACAATC GCGCTTGATC AAGTGGTGAA GAGTATTAAG AACATAGAGC ATAGAGCATA G
|
Protein sequence | MRMSSGFGRT LREAPSEAEL AAHQLILRAG LARQLLAGGM ALLPLGMRVF RRIEAIMHAE LAAIGAGEFR TPVVHAASLW EQTGRYAQYG EAMLRFNNRN QQALLFAPTH EEAVAELARR EVDSYRQLPS LLYQIHTKYR DELRVRGGLL RLREFTMLDA YSLDTDWAGL DMVYDRVALA FETIFERCGV RFTAVEADGG EMGGREPREY MAFSSSGEDS LVVCPLCSYA ANSEVAVRGQ AAANDDVVPA MSEIATPACT TIAELATFLQ VSEAQTAKAV FFNSAEKGLI FVVVRGDREV NEIKLRAAAG VSALEPATLE QISAVGAVAG YASPVGLSNV TVIADHSVVG VGGLVAGANR TGYHLQNVVY GRDWQATVVA DIANVEEGDA CPVCGAALSL ERGIEIGHIF KLGTRYTEAL GATYLDPQGQ AQPIVMGSYG IGLERLLQVI IEQHHDEKGI VWPASVAPFD LHLVQLGASA TVSEAANQLY QQLSEAGLSV LYDDRNESAG VKFNDADLLG MPLRLTVGER GLKQNVVELR QRATGVVETI ALDQVVKSIK NIEHRA
|
| |