Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_3837 |
Symbol | |
ID | 5744789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 4700500 |
End bp | 4701933 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641294949 |
Product | prolyl-tRNA synthetase |
Protein accession | YP_001560923 |
Protein GI | 160881955 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0442] Prolyl-tRNA synthetase |
TIGRFAM ID | [TIGR00408] prolyl-tRNA synthetase, family I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00668476 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAACG ATAAGAAATT AGTTGAATCG ATTACCTCTA TGGATGAAGA TTTCGCTCAA TGGTATACGG ATGTAGTAAA AAAAGCTGAA CTTGTAGATT ATTCAGGAGT AAGAGGATGT ACTATTTTCC GTCCAGCAGG ATATGCTATT TGGGAGAATA TCCAAAAGGA GTTAGATGCT AGATTTAAGG CGACTGGAGT AGAGAATGTT TACATGCCTA TGTTTATTCC AGAGAGCTTA TTAAATAAAG AAAAAGATCA TGTAGAGGGT TTTGCACCAG AAGTTGCTTG GGTTACTCAC GGTGGTGGAG AGCAGTTACA AGAACGTTTA TGTGTAAGAC CAACTTCTGA AACTTTATTC TGTGACTTTT ATTCTCATAT AATTGAATCT TATCGTGATC TTCCTAAACT ATACAATCAA TGGTGTTCCG TTGTACGTTG GGAGAAAACA ACCAGACCAT TCTTACGTAC TTTAGAGTTC TTATGGCAAG AAGGACATAC AGCGCATGCA ACAGCTGAGG AAGCAGAAGA AAGAACCATT CAAATGCTCA ATCTTTATGC TGATTTCTGT GAAGAAGTGT TAGCGATTCC TATGGTTCGT GGTAGAAAGA CAGACAAAGA AAAATTCGCA GGAGCTGAGG CAACTTATAC CATCGAAGCA TTGATGCATG ACGGTAAGGC GCTTCAATCA GGAACTAGCC ATAACTTTGG AGATGGATTT GCAAAAGCCT TTAACATTCA ATATACCGAT AAAGAAAATA AACTTCAATA TGTACACCAG ACTTCTTGGG GAATGACAAC TCGTCTGATT GGTGCATTAA TTATGGTACA CGGTGATAAT AGTGGTCTTG TATTGCCACC AAGAATTGCT CCTACTCAAG TTGTTATTGT TCCAATTATG CAAAAGAAGG AAGGCGTATT AGAAAAGGCG GCAGAACTTC GTGAAAAACT TGGCGCTTTC CGTGTAAAGG TCGACGATTC TGATAAGAGC CCAGGATGGA AATTCTCTGA GCATGAGATG CGTGGTATCC CAGTGCGTGT TGAAATCGGA CCAAAGGACA TTGAGGCAAA TCAAGCAGTT CTTGTACGTC GTGATACAAG AGAGAAGACT GTAGTTTCTC TTGATGAAAT TGATACAAAG ATTGGTGAAA TTCTTGAAGC TATGCAAAAA GAAATGTTAG AGCGTGCTAG AAATCATCGT GATGCTCATA CTTACGAGGC TCATTCTACA GAAGAATTTG CAGATGTTGT TGCTAACAAG CCAGGATTTG TAAAAGCTAT GTGGTGTGGA GAACGTGCCT GCGAAGACGA AATTAAGGAA AAGACAGGTG CTACTTCACG TTGTATGCCA TTTGCACAGG AACATATTGC TGATACCTGT GTATGCTGTG GTAAGCAAGC TAAATCCTTA GTGTATTGGG GAAAAGCTTA TTAA
|
Protein sequence | MANDKKLVES ITSMDEDFAQ WYTDVVKKAE LVDYSGVRGC TIFRPAGYAI WENIQKELDA RFKATGVENV YMPMFIPESL LNKEKDHVEG FAPEVAWVTH GGGEQLQERL CVRPTSETLF CDFYSHIIES YRDLPKLYNQ WCSVVRWEKT TRPFLRTLEF LWQEGHTAHA TAEEAEERTI QMLNLYADFC EEVLAIPMVR GRKTDKEKFA GAEATYTIEA LMHDGKALQS GTSHNFGDGF AKAFNIQYTD KENKLQYVHQ TSWGMTTRLI GALIMVHGDN SGLVLPPRIA PTQVVIVPIM QKKEGVLEKA AELREKLGAF RVKVDDSDKS PGWKFSEHEM RGIPVRVEIG PKDIEANQAV LVRRDTREKT VVSLDEIDTK IGEILEAMQK EMLERARNHR DAHTYEAHST EEFADVVANK PGFVKAMWCG ERACEDEIKE KTGATSRCMP FAQEHIADTC VCCGKQAKSL VYWGKAY
|
| |