Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1063 |
Symbol | |
ID | 4068712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1329744 |
End bp | 1331507 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637983071 |
Product | prolyl-tRNA synthetase |
Protein accession | YP_590140 |
Protein GI | 94968092 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0442] Prolyl-tRNA synthetase |
TIGRFAM ID | [TIGR00409] prolyl-tRNA synthetase, family II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.408216 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.634833 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCGCT GGTCTAAGCT CTTCATCCCT ACTCTGCGCG AGGCGCCTGC CGACGCCGAA GTCGCCAGCC ATAAGTTCCT CGTTCGGGCC GGATACATCC GCCAATTAGC CGCAGGCATT TACTCGTACT TGTTCCTCGG GAACCGCTCG ATGAACAAGA TCATCGGTAT CGTCCGCGAG GAGATGGACA AGATCGGCCA GGAATATTAC CTGCCGGCGT TAAATCCCCG TGAGATATGG GAAGCCAGCG GGCGCTGGGC GGTGATGGGC GACAACATGT TCCGCCTCAA GGACCGTAAG GGGGCGGAGC TTTGCCTCGG CATGACCCAC GAAGAGATCA TGACCGAGAT CGCGCGCAAG GAACTGCGCA GCTACAAGCA GTTGCCGCAG ATCTGGTACC AGATCCAGAC CAAGTTTCGC GACGAGCCTC GTCCGAGGTC GGGACTGCTG CGCGTGCGCC AGTTCATCAT GAAGGATTCG TACTCGTTCG ACATCGACGC CGCGGGTCTG GACATCAGCT ACCAGAAGCA CCATGACGCG TACTGCCGCA TCTTCGACCG TTGCGGGTTG AAGTACGTCG TTGTACACGC GGATTCCGGC GCGATGGGCG GCTCTGGGTC TCAGGAATTC ATGGTCTACA CCGACGCCGG TGAAGACCTC GTTGCGAGCT GCGCGAACTG CAGTTACGCC GCGAATGTGG AGAAAGCCAC ATCGAAGCTG GAAGCCATCG AGGACCTCGT TGCTACGGCC GATACGCCCG AACTCGTTCA TACGCCCGGG CAGAAGACGA TTGAGCAAGT CGCCGCGTAC CTTGGCGTCT CACCGAAGAA CAAGATCAAG ACGCTTGCCT ACATGATGGC CGCTCCCAAG GGAGCCAAGG ACGGAAGAGA GCAGGCCCTC GTCGTGCTCC TGCGAGGCGA CCATATGCTC AACGAGGCAA AGCTCGGCGC TGCAATCAAG GGGCGCGAAG TTCGTCCGAT GACGGAGGAA GAAATCCAGG ATCTGTTCCA TTCCCCTGCC GGATACCTTG GACCTCTCAA TGTCGAATGG GCCAAGACCT CCGAAGATAC CGAGAAACCT TTGCTGCTTT TGGATGAGGC ACTGGTCAGT CGCAAGAACC TGATCGCAGG CGCGAACAAA GAGGAGTATC ACGTTCGTAA CCTCACCCCG GGCGAAAGCT TCCAGTTCAC CGGCTCGGCT GACCTGCGGA TGGTCGCGGA AGGCGAGCCT TGCCCGAACT GCGGACACGC CTTGAAGGTG GGCAAGACGG TCGAGATCGG CCACATCTTC AAGCTCGGCT ACAAGTACAC GGACGCTATG GGTGCCCGCG TTCTCGACAA GGATGGCAAG GAAGTCATGC CGATCATGGG CAGCTACGGC ATTGGCATGG AGCGCATCTT GACGGCGTCA GTCGAGCAGT CCAACGACGA TAACGGCTTC TGGCTGCCTG CCCAGATCGC CCCGTTCGAA GTCGTTGTTA CCCCAACCAA CGTCAGCGAC GAAAAGCTGG CGAAAGGGGC TGAGGAGATC GCTGCAAAGC TGGAGGCTGC AGGGTTTGAC GTCATCCTGG ACGACCGCGA CGAGCGGCCG GGTGTGAAGT TCAAGGATGC AGACTTGGTG GGTATCCCCG TCCGGATAAA CGTGGGAAAG AAGTTCGTGG AGGGCAAAGT TGAGGTAATT CACCGCTCGA CACGTGAGTC GCTCGATGCT ACGATTCCGG AAATCGTTGA AAAGATAGCG GCTTGGTTGA AACCGAGTGC TTAA
|
Protein sequence | MHRWSKLFIP TLREAPADAE VASHKFLVRA GYIRQLAAGI YSYLFLGNRS MNKIIGIVRE EMDKIGQEYY LPALNPREIW EASGRWAVMG DNMFRLKDRK GAELCLGMTH EEIMTEIARK ELRSYKQLPQ IWYQIQTKFR DEPRPRSGLL RVRQFIMKDS YSFDIDAAGL DISYQKHHDA YCRIFDRCGL KYVVVHADSG AMGGSGSQEF MVYTDAGEDL VASCANCSYA ANVEKATSKL EAIEDLVATA DTPELVHTPG QKTIEQVAAY LGVSPKNKIK TLAYMMAAPK GAKDGREQAL VVLLRGDHML NEAKLGAAIK GREVRPMTEE EIQDLFHSPA GYLGPLNVEW AKTSEDTEKP LLLLDEALVS RKNLIAGANK EEYHVRNLTP GESFQFTGSA DLRMVAEGEP CPNCGHALKV GKTVEIGHIF KLGYKYTDAM GARVLDKDGK EVMPIMGSYG IGMERILTAS VEQSNDDNGF WLPAQIAPFE VVVTPTNVSD EKLAKGAEEI AAKLEAAGFD VILDDRDERP GVKFKDADLV GIPVRINVGK KFVEGKVEVI HRSTRESLDA TIPEIVEKIA AWLKPSA
|
| |