Gene Cphy_3837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3837 
Symbol 
ID5744789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4700500 
End bp4701933 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content40% 
IMG OID641294949 
Productprolyl-tRNA synthetase 
Protein accessionYP_001560923 
Protein GI160881955 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00408] prolyl-tRNA synthetase, family I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00668476 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAACG ATAAGAAATT AGTTGAATCG ATTACCTCTA TGGATGAAGA TTTCGCTCAA 
TGGTATACGG ATGTAGTAAA AAAAGCTGAA CTTGTAGATT ATTCAGGAGT AAGAGGATGT
ACTATTTTCC GTCCAGCAGG ATATGCTATT TGGGAGAATA TCCAAAAGGA GTTAGATGCT
AGATTTAAGG CGACTGGAGT AGAGAATGTT TACATGCCTA TGTTTATTCC AGAGAGCTTA
TTAAATAAAG AAAAAGATCA TGTAGAGGGT TTTGCACCAG AAGTTGCTTG GGTTACTCAC
GGTGGTGGAG AGCAGTTACA AGAACGTTTA TGTGTAAGAC CAACTTCTGA AACTTTATTC
TGTGACTTTT ATTCTCATAT AATTGAATCT TATCGTGATC TTCCTAAACT ATACAATCAA
TGGTGTTCCG TTGTACGTTG GGAGAAAACA ACCAGACCAT TCTTACGTAC TTTAGAGTTC
TTATGGCAAG AAGGACATAC AGCGCATGCA ACAGCTGAGG AAGCAGAAGA AAGAACCATT
CAAATGCTCA ATCTTTATGC TGATTTCTGT GAAGAAGTGT TAGCGATTCC TATGGTTCGT
GGTAGAAAGA CAGACAAAGA AAAATTCGCA GGAGCTGAGG CAACTTATAC CATCGAAGCA
TTGATGCATG ACGGTAAGGC GCTTCAATCA GGAACTAGCC ATAACTTTGG AGATGGATTT
GCAAAAGCCT TTAACATTCA ATATACCGAT AAAGAAAATA AACTTCAATA TGTACACCAG
ACTTCTTGGG GAATGACAAC TCGTCTGATT GGTGCATTAA TTATGGTACA CGGTGATAAT
AGTGGTCTTG TATTGCCACC AAGAATTGCT CCTACTCAAG TTGTTATTGT TCCAATTATG
CAAAAGAAGG AAGGCGTATT AGAAAAGGCG GCAGAACTTC GTGAAAAACT TGGCGCTTTC
CGTGTAAAGG TCGACGATTC TGATAAGAGC CCAGGATGGA AATTCTCTGA GCATGAGATG
CGTGGTATCC CAGTGCGTGT TGAAATCGGA CCAAAGGACA TTGAGGCAAA TCAAGCAGTT
CTTGTACGTC GTGATACAAG AGAGAAGACT GTAGTTTCTC TTGATGAAAT TGATACAAAG
ATTGGTGAAA TTCTTGAAGC TATGCAAAAA GAAATGTTAG AGCGTGCTAG AAATCATCGT
GATGCTCATA CTTACGAGGC TCATTCTACA GAAGAATTTG CAGATGTTGT TGCTAACAAG
CCAGGATTTG TAAAAGCTAT GTGGTGTGGA GAACGTGCCT GCGAAGACGA AATTAAGGAA
AAGACAGGTG CTACTTCACG TTGTATGCCA TTTGCACAGG AACATATTGC TGATACCTGT
GTATGCTGTG GTAAGCAAGC TAAATCCTTA GTGTATTGGG GAAAAGCTTA TTAA
 
Protein sequence
MANDKKLVES ITSMDEDFAQ WYTDVVKKAE LVDYSGVRGC TIFRPAGYAI WENIQKELDA 
RFKATGVENV YMPMFIPESL LNKEKDHVEG FAPEVAWVTH GGGEQLQERL CVRPTSETLF
CDFYSHIIES YRDLPKLYNQ WCSVVRWEKT TRPFLRTLEF LWQEGHTAHA TAEEAEERTI
QMLNLYADFC EEVLAIPMVR GRKTDKEKFA GAEATYTIEA LMHDGKALQS GTSHNFGDGF
AKAFNIQYTD KENKLQYVHQ TSWGMTTRLI GALIMVHGDN SGLVLPPRIA PTQVVIVPIM
QKKEGVLEKA AELREKLGAF RVKVDDSDKS PGWKFSEHEM RGIPVRVEIG PKDIEANQAV
LVRRDTREKT VVSLDEIDTK IGEILEAMQK EMLERARNHR DAHTYEAHST EEFADVVANK
PGFVKAMWCG ERACEDEIKE KTGATSRCMP FAQEHIADTC VCCGKQAKSL VYWGKAY