Gene EcolC_3855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3855 
Symbol 
ID6067544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4213456 
End bp4214433 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content52% 
IMG OID641603270 
Productlysyl-tRNA synthetase 
Protein accessionYP_001726786 
Protein GI170021832 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2269] Truncated, possibly inactive, lysyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00462] lysyl-tRNA synthetase-like protein GenX 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00133285 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGAAA CGGCATCCTG GCAGCCGAGC GCATCCATTC CTAACTTATT AAAACGCGCG 
GCGATTATGG CGGAGATCCG TCGTTTCTTT GCCGATCGTG GAGTGCTGGA GGTGGAGACG
CCTTGTATGA GCCAGGCGAC GGTAACCGAT ATTCATTTGG TCCCGTTTGA GACACGTTTC
GTTGGCCCCG GGCATTCGCA GGGGATGAAT CTCTGGTTAA TGACCAGCCC GGAATACCAT
ATGAAACGCC TGCTGGTTGC CGGTTGTGGG CCGGTATTCC AGCTGTGCCG CAGTTTCCGT
AATGAAGAGA TGGGGCGTTA TCACAACCCT GAGTTCACTA TGCTGGAGTG GTATCGACCG
CACTATGATA TGTACCGGTT GATGAACGAG GTGGACGATC TCTTACAACA GGTGCTGGAC
TGCCCTGCAG CAGAAAGCCT TTCTTATCAA CAAGCTTTCT TGCGTTATCT GGAAATTGAC
CCACTCTCTG CCGACAAAAC GCAACTGCGG GAAGTGGCAG CGAAACTGGA TTTGAGCAAT
GTTGCTGATA CCGAAGAAGA CCGCGACACG TTGCTACAAT TGCTGTTTAC CTTTGGCGTA
GAGCCAAATA TTGGTAAAGA AAAACCGACC TTTGTGTACC ACTTTCCAGC CAGCCAGGCA
TCACTGGCGC AAATCAGTAC CGAAGATCAT CGGGTCGCTG AACGCTTTGA GGTTTATTAT
AAAGGTATTG AGCTGGCGAA TGGTTTCCAT GAATTGACGG ATGCCCGTGA GCAGCAACAA
CGCTTTGAAC AAGATAACCG TAAGCGCGCG GCGCGCGGTT TGCCGCAGCA CCCCATTGAC
CAGAATCTGA TTGACGCCTT GAAAGTCGGT ATGCCTGACT GTTCCGGCGT GGCATTAGGT
GTTGATCGTC TGGTGATGTT GGCGCTGGGC GCGGAGACAC TGGCTGAAGT CATCGCCTTT
AGCGTTGACC GGGCATAA
 
Protein sequence
MSETASWQPS ASIPNLLKRA AIMAEIRRFF ADRGVLEVET PCMSQATVTD IHLVPFETRF 
VGPGHSQGMN LWLMTSPEYH MKRLLVAGCG PVFQLCRSFR NEEMGRYHNP EFTMLEWYRP
HYDMYRLMNE VDDLLQQVLD CPAAESLSYQ QAFLRYLEID PLSADKTQLR EVAAKLDLSN
VADTEEDRDT LLQLLFTFGV EPNIGKEKPT FVYHFPASQA SLAQISTEDH RVAERFEVYY
KGIELANGFH ELTDAREQQQ RFEQDNRKRA ARGLPQHPID QNLIDALKVG MPDCSGVALG
VDRLVMLALG AETLAEVIAF SVDRA