Gene Emin_1503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1503 
Symbol 
ID6263818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1595081 
End bp1596355 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content41% 
IMG OID642611991 
Productseryl-tRNA synthetase 
Protein accessionYP_001876388 
Protein GI187251906 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0172] Seryl-tRNA synthetase 
TIGRFAM ID[TIGR00414] seryl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.000981154 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAGACA TTAAATTATT CGCAAATGAC TTAGAAGGGG TAAAAAAATC TCTTGCTTCA 
AGAAATCCGG CCTTGCTTAC TTTAGTTGGC GAAATAACTG AGCTTAATAA AACATATAAA
GAACTTTTAA CAAGTGTTGA AAATATGAGG GCAAAAAGAA ACGAACTTTC TAAATCAGTA
GGCATTTTAA AACAAAAAGA CCCTGCCGCC GCTGAAGAAG CGATGAAAGA AGTTTCTTTG
ATTAAAACAG GTATGTCCGA AAAAGAAAAC ATGCTTGAGG AAATTAAAAA AAAGATTAGT
AATACCATGC TTAATATCCC CAACATGCCG GACCCTTCGG TAACAATAGG TAAAGATGAA
AAAGATAATA AAGAAATCCG CAAAGACGGC GCGGCGCCTT CTTTTTCCTT TAAACCTTTA
GACCACCACG CCGTGGGTGA AAAACTTGGC ATTTTAGATT TTGAAACGGC TGCTGTATTA
TCCGGAAGCC GCTTTGCCCT TTTAAAAGGT GACGGAGCCA GGCTTGAACG CGCTTTAATA
AGTTATATGT TAGATAAGCA CGCAAAAAAA GGATATACCG AAGTTGTGCC GCCTTCAATA
GTTAATGAAG AAATTTTAAT AGGCACCGGA CAGCTGCCTA AGTTTAGGGA AGATATGTAC
GCCCTTGAAG GCGAGCCTAA ACAATTTTTA ATTTCCACCG CAGAGATTCC TTTAACCAAC
ATGAACAGGG GTAAAGTTTT GCAGGAAGCC GAACTTCCCG TAAAGCTTAC TGCAGGCACG
CCCTGTTTTA GGAAAGAGGC GGGTACCTAC GGCAAAGACA CCAGAGGACT TATACGAAAC
CACCAGTTTG ATAAAGTTGA GCTTGTTATG ATATCAAACT CAAACGATTC TTTTAACCTG
TTAGAAGCAA TGACTGCCGA CGCTGAGGAT GTTTTAAAAG GCTTAGGCCT AGCTTACCGC
ACTGTTGAAC TTTGCACGGG GGATATGGGG TTTAGCTCGG CAAAAACATA TGATATAGAA
GTTTGGATGC CAAGCGAAAA CAAATACCGC GAAATTTCTT CCTGCTCGAA TTGCACTTCT
TTCCAAGCGC GCAGAATGAA TTTACGCTAT AAAAATGCCG AAGGTAAAAT AGAGTTTGTC
CATACGCTTA ACGGAAGCGG CGTAGCCGTA GGCCGCGCTC TTGCCGCTAT CCTTGAAAAC
TACCAGCAGG AAGACGGCTC CGTAATAGTG CCTGAAGCAT TAAGACCTTA TTTCGGTAAA
GATATAATAA AATAA
 
Protein sequence
MLDIKLFAND LEGVKKSLAS RNPALLTLVG EITELNKTYK ELLTSVENMR AKRNELSKSV 
GILKQKDPAA AEEAMKEVSL IKTGMSEKEN MLEEIKKKIS NTMLNIPNMP DPSVTIGKDE
KDNKEIRKDG AAPSFSFKPL DHHAVGEKLG ILDFETAAVL SGSRFALLKG DGARLERALI
SYMLDKHAKK GYTEVVPPSI VNEEILIGTG QLPKFREDMY ALEGEPKQFL ISTAEIPLTN
MNRGKVLQEA ELPVKLTAGT PCFRKEAGTY GKDTRGLIRN HQFDKVELVM ISNSNDSFNL
LEAMTADAED VLKGLGLAYR TVELCTGDMG FSSAKTYDIE VWMPSENKYR EISSCSNCTS
FQARRMNLRY KNAEGKIEFV HTLNGSGVAV GRALAAILEN YQQEDGSVIV PEALRPYFGK
DIIK