Gene EcSMS35_4626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4626 
SymbolgenX 
ID6144552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4727292 
End bp4728269 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content52% 
IMG OID641619442 
Productlysyl-tRNA synthetase 
Protein accessionYP_001746553 
Protein GI170684036 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2269] Truncated, possibly inactive, lysyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00462] lysyl-tRNA synthetase-like protein GenX 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0839984 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAA CGGCATCCTG GCAGCCGAGC GCATCCATTC CTAACTTATT AAAACGCGCG 
GCGATTATGG CGGAGATCCG TCGTTTCTTT GCCGATCGTG GAGTGCTGGA GGTGGAGACG
CCTTGTATGA GCCAGGCGAC GGTAACCGAT ATTCATTTGG TCCCGTTTGA GACACGTTTC
GTTGGCCCCG GGCATTCGCA GGGGATGAAT CTCTGGTTAA TGACCAGCCC TGAATACCAT
ATGAAACGCC TGCTGGTTGC TGGTTGTGGG CCGGTATTCC AGCTGTGCCG CAGCTTCCGT
AATGAAGAGA TGGGGCGTTA TCACAACCCT GAGTTCACTA TGCTTGAGTG GTATCGACCG
CACTATGATA TGTACCGGTT GATGAACGAG GTGGACGATC TCTTACAACA AGTGCTGGAC
TGCCCGGCAG CAGAAAGCCT TTCTTATCAA CAAGCTTTCT TGCGTTATCT GGAAATTGAC
CCGCTCTCTG CCGACAAAAC GCAACTGCGG GAAGTGGCAG CGAAACTGGA TTTGAGCAAC
GTTGCAGATA CCGAAGAAGA CCGCGACACG TTGCTACAAT TGCTGTTTAC CTTTGGCGTA
GAGCCAAATA TTGGCAAAGA AAAACCGACC TTTGTGTACC ACTTTCCAGC CAGCCAGGCA
TCACTGGCGC AAATCAGTAC CGAAGATCAT CGGGTCGCTG AACGCTTTGA GGTTTATTAT
AAAGGTATTG AGCTGGCGAA TGGTTTCCAT GAATTGACGG ATGCCCGTGA GCAGCAACAA
CGCTTTGAAC AAGATAACCG TAAGCGCGCG GCGCGCGGTT TGCCGCAGCA CCCCATTGAC
CAGAATCTGA TTGAAGCCTT GAAAGTCGGT ATGCCTGACT GTTCCGGCGT GGCATTAGGC
GTTGATCGTC TGGTGATGTT GGCGCTGGGC GCGGAGACAC TGGCTGAAGT CATCGCCTTT
AGCGTTGACC GGGCATAA
 
Protein sequence
MSETASWQPS ASIPNLLKRA AIMAEIRRFF ADRGVLEVET PCMSQATVTD IHLVPFETRF 
VGPGHSQGMN LWLMTSPEYH MKRLLVAGCG PVFQLCRSFR NEEMGRYHNP EFTMLEWYRP
HYDMYRLMNE VDDLLQQVLD CPAAESLSYQ QAFLRYLEID PLSADKTQLR EVAAKLDLSN
VADTEEDRDT LLQLLFTFGV EPNIGKEKPT FVYHFPASQA SLAQISTEDH RVAERFEVYY
KGIELANGFH ELTDAREQQQ RFEQDNRKRA ARGLPQHPID QNLIEALKVG MPDCSGVALG
VDRLVMLALG AETLAEVIAF SVDRA