Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1334 |
Symbol | |
ID | 8136661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1567439 |
End bp | 1569160 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644868948 |
Product | prolyl-tRNA synthetase |
Protein accession | YP_003021152 |
Protein GI | 253699963 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0442] Prolyl-tRNA synthetase |
TIGRFAM ID | [TIGR00409] prolyl-tRNA synthetase, family II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.0000102845 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTTATT CCCAGTACTT TATCCCGACT GTCAAGGAGA CTCCCTCCGA CGCGGAGGTC ATCTCCCATA AGTTGATGCT GCGCGCCGGC ATGATCAGGA AACTCGCCGC CGGTATCTAC AACTACCTCC CGTTCGGGCT TCGTTCCATC CGTAAGGTCG AGGCCATCGT CCGCGAGGAG ATGAACCGCG CTGGCGCCAT CGAGCTCCTG ATGCCCGCCG TCCAGCCTGC CGAACTCTGG AAGGAGTCGG GGCGCTGGGA ATTCTACGGC AAGGAACTGC TCCGCTTCAA CGACAGAAAG GACGCCGAGT TCTGCATGGG CCCCACCCAC GAGGAGGTCA TCACCGACCT CATCCGCAAG GAAGTCCGCA GCTACCGGCA ATTGCCGATC AACCTGTACC AGATCCAGGG CAAGTTCCGC GACGAGATCC GCCCCCGCTT CGGCCTGATG CGCGGCCGCG AGTTCATCAT GAAGGACGCC TACTCCTTCG ACGTGAACGA GGCCGGCGCC GACGTCTCCT ACGAGAAGAT GTACAAGGCC TACCGCCGCA TCTTCGAGCG CTGCGGCCTG AAGTTCCGCG CCGTCGAGGC CGACACCGGC ACCATCGGCG GGAACTACTC CCACGAGTTC ATGGTGCTCG CCGACTCCGG CGAGGACGCC ATCGTCTCCT GCTCTGCCTG CGAGTACGCC GCCAACATGG AGAAGGCTGA GACCCGTAAA GGTGAGGGGA TCGAGCATGC CGACCCGCGT CCGATGGAGC ACGTCAGCAC CCCGGGGCAG AAGAGCATCG AGGACGTGGC AGCCTTCCTC GGCGTGCAGA ACACCCAGGT CGTGAAGACG CTGGTGCTGG TCGCCGACGG CGAGCCGGTC GTGGCCCTTA TCCGCGGCGA CTATGACCTG AACGAGATCA AGCTGAAAAA CCACCTGGGG TGCGCGGAGC TTGAGATGGC CGAGGACGAC GTGGTCGTCA AGGTCACCGG CGCACCCACC GGCTACGCTG GCCCCGTGGG GCTCGCGGCC AAGGTGAAGG TCGTAGCCGA CCTCTCCCTG GAGGGGATGC ACAACTTCGT CACCGGCGCC AACGCAGCCG ACACCCACCT GAAAAACGTG AACATCGGGC GCGACTTCAG CGTCAGCGGC TTCGTCGACA TCAGGAACGT CGTCATCGGC GACGCCTGCC CGCGCTGTGA CAGCGGGAAG CTGGAGATCT GGCGCGGCAT CGAGGTCGGT CACGTCTTCA AGCTCGGCAC CAAGTACTCC AAGGCCCTCA AGGCCACCTT CCTCGATGCC GACGGCAAAG AGCAGACCAT CTTCATGGGA TGCTACGGCA TCGGCGTCGG GCGCACCGTC GCCGCCTGCA TAGAGCAGAA CCACGATGAG AACGGCATCA TCTTCCCGAT TCCCATCGCG CCGTTCCAGT GCATCATCTC CTCGCTCAGC GTGAAAGAGG ACGAGGTCAA GGCTGCCTCC GAGTCCATCT ATCAGGAGCT TCTGGAAGCG GGCATAGAAG TGCTTCTCGA CGACCGCGAC GAGCGTCCAG GCTTCAAGTT CAAAGACGCC GACCTGATCG GGATTCCGCT GCGCATCGTC GTGGGCGCGA AGGCCCTGGC GGAAGGCAAA GTCGAACTGA AAGAGAGAAG AAGCGGCGAA GTAGAGGTTC TCCCCATCGC CGAAGCCATA GCCAAGGTAA AAGCCGCCGT CAAAGAGGCG CTGCAGGTAT AA
|
Protein sequence | MRYSQYFIPT VKETPSDAEV ISHKLMLRAG MIRKLAAGIY NYLPFGLRSI RKVEAIVREE MNRAGAIELL MPAVQPAELW KESGRWEFYG KELLRFNDRK DAEFCMGPTH EEVITDLIRK EVRSYRQLPI NLYQIQGKFR DEIRPRFGLM RGREFIMKDA YSFDVNEAGA DVSYEKMYKA YRRIFERCGL KFRAVEADTG TIGGNYSHEF MVLADSGEDA IVSCSACEYA ANMEKAETRK GEGIEHADPR PMEHVSTPGQ KSIEDVAAFL GVQNTQVVKT LVLVADGEPV VALIRGDYDL NEIKLKNHLG CAELEMAEDD VVVKVTGAPT GYAGPVGLAA KVKVVADLSL EGMHNFVTGA NAADTHLKNV NIGRDFSVSG FVDIRNVVIG DACPRCDSGK LEIWRGIEVG HVFKLGTKYS KALKATFLDA DGKEQTIFMG CYGIGVGRTV AACIEQNHDE NGIIFPIPIA PFQCIISSLS VKEDEVKAAS ESIYQELLEA GIEVLLDDRD ERPGFKFKDA DLIGIPLRIV VGAKALAEGK VELKERRSGE VEVLPIAEAI AKVKAAVKEA LQV
|
| |