Gene GM21_1334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1334 
Symbol 
ID8136661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1567439 
End bp1569160 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content62% 
IMG OID644868948 
Productprolyl-tRNA synthetase 
Protein accessionYP_003021152 
Protein GI253699963 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.0000102845 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTTATT CCCAGTACTT TATCCCGACT GTCAAGGAGA CTCCCTCCGA CGCGGAGGTC 
ATCTCCCATA AGTTGATGCT GCGCGCCGGC ATGATCAGGA AACTCGCCGC CGGTATCTAC
AACTACCTCC CGTTCGGGCT TCGTTCCATC CGTAAGGTCG AGGCCATCGT CCGCGAGGAG
ATGAACCGCG CTGGCGCCAT CGAGCTCCTG ATGCCCGCCG TCCAGCCTGC CGAACTCTGG
AAGGAGTCGG GGCGCTGGGA ATTCTACGGC AAGGAACTGC TCCGCTTCAA CGACAGAAAG
GACGCCGAGT TCTGCATGGG CCCCACCCAC GAGGAGGTCA TCACCGACCT CATCCGCAAG
GAAGTCCGCA GCTACCGGCA ATTGCCGATC AACCTGTACC AGATCCAGGG CAAGTTCCGC
GACGAGATCC GCCCCCGCTT CGGCCTGATG CGCGGCCGCG AGTTCATCAT GAAGGACGCC
TACTCCTTCG ACGTGAACGA GGCCGGCGCC GACGTCTCCT ACGAGAAGAT GTACAAGGCC
TACCGCCGCA TCTTCGAGCG CTGCGGCCTG AAGTTCCGCG CCGTCGAGGC CGACACCGGC
ACCATCGGCG GGAACTACTC CCACGAGTTC ATGGTGCTCG CCGACTCCGG CGAGGACGCC
ATCGTCTCCT GCTCTGCCTG CGAGTACGCC GCCAACATGG AGAAGGCTGA GACCCGTAAA
GGTGAGGGGA TCGAGCATGC CGACCCGCGT CCGATGGAGC ACGTCAGCAC CCCGGGGCAG
AAGAGCATCG AGGACGTGGC AGCCTTCCTC GGCGTGCAGA ACACCCAGGT CGTGAAGACG
CTGGTGCTGG TCGCCGACGG CGAGCCGGTC GTGGCCCTTA TCCGCGGCGA CTATGACCTG
AACGAGATCA AGCTGAAAAA CCACCTGGGG TGCGCGGAGC TTGAGATGGC CGAGGACGAC
GTGGTCGTCA AGGTCACCGG CGCACCCACC GGCTACGCTG GCCCCGTGGG GCTCGCGGCC
AAGGTGAAGG TCGTAGCCGA CCTCTCCCTG GAGGGGATGC ACAACTTCGT CACCGGCGCC
AACGCAGCCG ACACCCACCT GAAAAACGTG AACATCGGGC GCGACTTCAG CGTCAGCGGC
TTCGTCGACA TCAGGAACGT CGTCATCGGC GACGCCTGCC CGCGCTGTGA CAGCGGGAAG
CTGGAGATCT GGCGCGGCAT CGAGGTCGGT CACGTCTTCA AGCTCGGCAC CAAGTACTCC
AAGGCCCTCA AGGCCACCTT CCTCGATGCC GACGGCAAAG AGCAGACCAT CTTCATGGGA
TGCTACGGCA TCGGCGTCGG GCGCACCGTC GCCGCCTGCA TAGAGCAGAA CCACGATGAG
AACGGCATCA TCTTCCCGAT TCCCATCGCG CCGTTCCAGT GCATCATCTC CTCGCTCAGC
GTGAAAGAGG ACGAGGTCAA GGCTGCCTCC GAGTCCATCT ATCAGGAGCT TCTGGAAGCG
GGCATAGAAG TGCTTCTCGA CGACCGCGAC GAGCGTCCAG GCTTCAAGTT CAAAGACGCC
GACCTGATCG GGATTCCGCT GCGCATCGTC GTGGGCGCGA AGGCCCTGGC GGAAGGCAAA
GTCGAACTGA AAGAGAGAAG AAGCGGCGAA GTAGAGGTTC TCCCCATCGC CGAAGCCATA
GCCAAGGTAA AAGCCGCCGT CAAAGAGGCG CTGCAGGTAT AA
 
Protein sequence
MRYSQYFIPT VKETPSDAEV ISHKLMLRAG MIRKLAAGIY NYLPFGLRSI RKVEAIVREE 
MNRAGAIELL MPAVQPAELW KESGRWEFYG KELLRFNDRK DAEFCMGPTH EEVITDLIRK
EVRSYRQLPI NLYQIQGKFR DEIRPRFGLM RGREFIMKDA YSFDVNEAGA DVSYEKMYKA
YRRIFERCGL KFRAVEADTG TIGGNYSHEF MVLADSGEDA IVSCSACEYA ANMEKAETRK
GEGIEHADPR PMEHVSTPGQ KSIEDVAAFL GVQNTQVVKT LVLVADGEPV VALIRGDYDL
NEIKLKNHLG CAELEMAEDD VVVKVTGAPT GYAGPVGLAA KVKVVADLSL EGMHNFVTGA
NAADTHLKNV NIGRDFSVSG FVDIRNVVIG DACPRCDSGK LEIWRGIEVG HVFKLGTKYS
KALKATFLDA DGKEQTIFMG CYGIGVGRTV AACIEQNHDE NGIIFPIPIA PFQCIISSLS
VKEDEVKAAS ESIYQELLEA GIEVLLDDRD ERPGFKFKDA DLIGIPLRIV VGAKALAEGK
VELKERRSGE VEVLPIAEAI AKVKAAVKEA LQV