Gene Emin_0684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0684 
Symbol 
ID6263247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp757783 
End bp759504 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content44% 
IMG OID642611156 
Productprolyl-tRNA synthetase 
Protein accessionYP_001875576 
Protein GI187251094 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000975412 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000000032501 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATTAA CACAGTATTA CCTGCCCACT TTAAAAGAAG CGCCTAAAGA CGCGGACACA 
ATTTCCGCCA AACTTATGTT AAGAGCGGGC CTTATACGCA AAACCGCCAG CGGTATTTAT
GAATGGTTGC CCTTAGGTTT AAAGGTGCTT AAAAAAGTTG AGCAGATAGT GCGTGAGGAA
ATGGACGCCG CCGGTGCGCA TGAAGTTTGG CTTCCTTTAA TACAGCCCAA AGAACTTTGG
GAAGAAAGCG GCCGCTGGAC ATATTACGGC AAAGAACTTT TAAGAATTAA AGACCGCAAA
GGGGCGGAAT TTTGTTTTGC TCCCACCGCC GAGGAAGTTA TTACCGATGT TGTAAGAAGA
GACGTTACCT CCTATAAACA ATTACCAGTG GCTTTATACC AGTTTGCCTC TAAATTTAGA
GATGAAATCA GGCCCCGCTT CGGCGTTATG AGAGCCAGAG AGTTTTATAT GAAAGACGCC
TACTCTTTCC ACGCTACGGA AGAATCAATT AATGAATGGT ATCTCAAATT TTTTGAAGCT
TACAAGAAAG TCTGCACACG CTGCGGTTTT AAATTTAAAG CGGTTGAGGC CGATACCGGC
GCGATAGGCG GCAATTTTTC GCACGAGTTT ATGGTTCTTG CCGACACGGG TGAAAACGAA
ATAGCAGACT GTGACTGCGG TTATGCCGCC AACACCGAAA AGGCCGAAAT TTTTAAACCG
AAATTTCCCG CCGCTAAAGA AGAATTAAAA ACTATTGAAA AAGTTAACAC CCCCAACGCC
ACAACCATTG AAGACGTTGC CAAAATGCTT GGACAAACGG CAGATAGGTT TATTAAACTT
TTGGTTTTTA CGGCCGACGG ACAGCCCGTT GTCGCTTTAA TGCGCGGCGA CCATGAACTT
AACGAGCACA AATTAAAAGC CTTATTAAAA GCGCAGGAGC TTGAAAAAGC TAATGAAGAA
ACCTACGCCA AGGTAACAGG CTCTTTTGTT GGTTACGCGG GGCCCGTGGG TTTGAAAGAA
AAGAACCCTA AAATCAAATT GTTTGCCGAT TACCATGTGG CAGGCATTGT TAACGGTATC
GCGGGCGGTA ACGAGAAAGA CGTTCACATT ATTAATGTTA CCCCCTCGCG CGACTTTACG
CCTGACGTTT ATGCCGATTT AAAAATCGCC TCCGAAGGTG ATTTATGCGG CAAATGCGGC
AAAAAGTTTA ATTTTACAAG AGGCATTGAA GTAGGCCACA CGTTTAAACT GGGTACAAAA
TATTCCCAGT CCATGAAGGC CGAGTTTTTG GACGAAAACC AAAAATCACA CCCTTTCTTA
ATGGGCTGCT ACGGCATAGG AATAAGCCGC ATCGTGGCCG CGGCTATTGA GCAAAGCCAC
GACGAAAACG GCATTATCTG GCCCGCTCCT TTAGCGCCTT TTGATATTTA TTTAGTTTCA
ATTGACACCG ATATAAACCC TAAAGTTAAA GAAGAAACAG ACAGTATTTA TAATCAGCTT
ACACAAGCGG GGTTAAACGT TTTGCTCGAC GACCGCAACG AAAGGCCCGG CATTAAATTT
AAAGACGCCG ACCTCATAGG CCTGCCACAT AGAATTGTAA TAAGCAGCCG CACGGTTGAA
ACGGGTGAAT ACGAGTATAA ACAAAGAACT TCAAAAGAAG CTATAAGACG CAAACTGGCA
GACATATCTG AACAGATTAA AGAATTTCAG GCAAGCAAGT AA
 
Protein sequence
MKLTQYYLPT LKEAPKDADT ISAKLMLRAG LIRKTASGIY EWLPLGLKVL KKVEQIVREE 
MDAAGAHEVW LPLIQPKELW EESGRWTYYG KELLRIKDRK GAEFCFAPTA EEVITDVVRR
DVTSYKQLPV ALYQFASKFR DEIRPRFGVM RAREFYMKDA YSFHATEESI NEWYLKFFEA
YKKVCTRCGF KFKAVEADTG AIGGNFSHEF MVLADTGENE IADCDCGYAA NTEKAEIFKP
KFPAAKEELK TIEKVNTPNA TTIEDVAKML GQTADRFIKL LVFTADGQPV VALMRGDHEL
NEHKLKALLK AQELEKANEE TYAKVTGSFV GYAGPVGLKE KNPKIKLFAD YHVAGIVNGI
AGGNEKDVHI INVTPSRDFT PDVYADLKIA SEGDLCGKCG KKFNFTRGIE VGHTFKLGTK
YSQSMKAEFL DENQKSHPFL MGCYGIGISR IVAAAIEQSH DENGIIWPAP LAPFDIYLVS
IDTDINPKVK EETDSIYNQL TQAGLNVLLD DRNERPGIKF KDADLIGLPH RIVISSRTVE
TGEYEYKQRT SKEAIRRKLA DISEQIKEFQ ASK