Gene Acid345_1063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1063 
Symbol 
ID4068712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1329744 
End bp1331507 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content58% 
IMG OID637983071 
Productprolyl-tRNA synthetase 
Protein accessionYP_590140 
Protein GI94968092 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.408216 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.634833 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCGCT GGTCTAAGCT CTTCATCCCT ACTCTGCGCG AGGCGCCTGC CGACGCCGAA 
GTCGCCAGCC ATAAGTTCCT CGTTCGGGCC GGATACATCC GCCAATTAGC CGCAGGCATT
TACTCGTACT TGTTCCTCGG GAACCGCTCG ATGAACAAGA TCATCGGTAT CGTCCGCGAG
GAGATGGACA AGATCGGCCA GGAATATTAC CTGCCGGCGT TAAATCCCCG TGAGATATGG
GAAGCCAGCG GGCGCTGGGC GGTGATGGGC GACAACATGT TCCGCCTCAA GGACCGTAAG
GGGGCGGAGC TTTGCCTCGG CATGACCCAC GAAGAGATCA TGACCGAGAT CGCGCGCAAG
GAACTGCGCA GCTACAAGCA GTTGCCGCAG ATCTGGTACC AGATCCAGAC CAAGTTTCGC
GACGAGCCTC GTCCGAGGTC GGGACTGCTG CGCGTGCGCC AGTTCATCAT GAAGGATTCG
TACTCGTTCG ACATCGACGC CGCGGGTCTG GACATCAGCT ACCAGAAGCA CCATGACGCG
TACTGCCGCA TCTTCGACCG TTGCGGGTTG AAGTACGTCG TTGTACACGC GGATTCCGGC
GCGATGGGCG GCTCTGGGTC TCAGGAATTC ATGGTCTACA CCGACGCCGG TGAAGACCTC
GTTGCGAGCT GCGCGAACTG CAGTTACGCC GCGAATGTGG AGAAAGCCAC ATCGAAGCTG
GAAGCCATCG AGGACCTCGT TGCTACGGCC GATACGCCCG AACTCGTTCA TACGCCCGGG
CAGAAGACGA TTGAGCAAGT CGCCGCGTAC CTTGGCGTCT CACCGAAGAA CAAGATCAAG
ACGCTTGCCT ACATGATGGC CGCTCCCAAG GGAGCCAAGG ACGGAAGAGA GCAGGCCCTC
GTCGTGCTCC TGCGAGGCGA CCATATGCTC AACGAGGCAA AGCTCGGCGC TGCAATCAAG
GGGCGCGAAG TTCGTCCGAT GACGGAGGAA GAAATCCAGG ATCTGTTCCA TTCCCCTGCC
GGATACCTTG GACCTCTCAA TGTCGAATGG GCCAAGACCT CCGAAGATAC CGAGAAACCT
TTGCTGCTTT TGGATGAGGC ACTGGTCAGT CGCAAGAACC TGATCGCAGG CGCGAACAAA
GAGGAGTATC ACGTTCGTAA CCTCACCCCG GGCGAAAGCT TCCAGTTCAC CGGCTCGGCT
GACCTGCGGA TGGTCGCGGA AGGCGAGCCT TGCCCGAACT GCGGACACGC CTTGAAGGTG
GGCAAGACGG TCGAGATCGG CCACATCTTC AAGCTCGGCT ACAAGTACAC GGACGCTATG
GGTGCCCGCG TTCTCGACAA GGATGGCAAG GAAGTCATGC CGATCATGGG CAGCTACGGC
ATTGGCATGG AGCGCATCTT GACGGCGTCA GTCGAGCAGT CCAACGACGA TAACGGCTTC
TGGCTGCCTG CCCAGATCGC CCCGTTCGAA GTCGTTGTTA CCCCAACCAA CGTCAGCGAC
GAAAAGCTGG CGAAAGGGGC TGAGGAGATC GCTGCAAAGC TGGAGGCTGC AGGGTTTGAC
GTCATCCTGG ACGACCGCGA CGAGCGGCCG GGTGTGAAGT TCAAGGATGC AGACTTGGTG
GGTATCCCCG TCCGGATAAA CGTGGGAAAG AAGTTCGTGG AGGGCAAAGT TGAGGTAATT
CACCGCTCGA CACGTGAGTC GCTCGATGCT ACGATTCCGG AAATCGTTGA AAAGATAGCG
GCTTGGTTGA AACCGAGTGC TTAA
 
Protein sequence
MHRWSKLFIP TLREAPADAE VASHKFLVRA GYIRQLAAGI YSYLFLGNRS MNKIIGIVRE 
EMDKIGQEYY LPALNPREIW EASGRWAVMG DNMFRLKDRK GAELCLGMTH EEIMTEIARK
ELRSYKQLPQ IWYQIQTKFR DEPRPRSGLL RVRQFIMKDS YSFDIDAAGL DISYQKHHDA
YCRIFDRCGL KYVVVHADSG AMGGSGSQEF MVYTDAGEDL VASCANCSYA ANVEKATSKL
EAIEDLVATA DTPELVHTPG QKTIEQVAAY LGVSPKNKIK TLAYMMAAPK GAKDGREQAL
VVLLRGDHML NEAKLGAAIK GREVRPMTEE EIQDLFHSPA GYLGPLNVEW AKTSEDTEKP
LLLLDEALVS RKNLIAGANK EEYHVRNLTP GESFQFTGSA DLRMVAEGEP CPNCGHALKV
GKTVEIGHIF KLGYKYTDAM GARVLDKDGK EVMPIMGSYG IGMERILTAS VEQSNDDNGF
WLPAQIAPFE VVVTPTNVSD EKLAKGAEEI AAKLEAAGFD VILDDRDERP GVKFKDADLV
GIPVRINVGK KFVEGKVEVI HRSTRESLDA TIPEIVEKIA AWLKPSA