Gene EcSMS35_0702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0702 
SymbolglnS 
ID6146950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp705473 
End bp707137 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content53% 
IMG OID641615592 
Productglutaminyl-tRNA synthetase 
Protein accessionYP_001742791 
Protein GI170681669 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0008] Glutamyl- and glutaminyl-tRNA synthetases 
TIGRFAM ID[TIGR00440] glutaminyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.237561 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGG CAGAAGCCCG CCCGACTAAC TTTATTCGTC AGATCATCGA TGAAGATCTG 
GCCAGTGGTA AGCACACCAC AGTACACACC CGTTTCCCGC CGGAGCCGAA TGGCTATCTG
CATATTGGCC ATGCGAAATC TATCTGCCTG AACTTCGGGA TCGCCCAGGA CTATAAAGGC
CAGTGCAACC TGCGTTTCGA CGACACTAAC CCGGTAAAAG AAGATATCGA GTATGTTGAG
TCGATCAAAA ACGACGTTGA GTGGTTAGGT TTTCACTGGT CTGGTAACGT CCGTTACTCC
TCCGATTATT TTGATCAGCT CCACGCCTAT GCGATCGAAC TGATCAATAA AGGCCTGGCG
TACGTTGATG AACTGACACC GGAACAGATC CGCGAATACC GTGGCACCCT GACGCAGCCG
GGTAAAAACA GCCCGTACCG CGATCGCAGC GTTGAAGAGA ACCTGGCGCT GTTCGAGAAA
ATGCGTACCG GCGGTTTTGA AGAAGGTAAA GCCTGCCTGC GTGCGAAGAT CGACATGGCT
TCGCCGTTTA TCGTGATGCG CGATCCGGTG CTGTACCGCA TTAAGTTTGC TGAACACCAC
CAGACTGGCA ACAAGTGGTG CATCTACCCG ATGTACGACT TTACCCACTG CATCAGCGAT
GCGCTGGAAG GTATTACGCA CTCTCTGTGT ACGCTTGAGT TCCAGGACAA CCGTCGCCTG
TACGACTGGG TGCTGGACAA CATCACCATT CCTGTTCACC CGCGCCAGTA CGAATTCTCG
CGCCTGAATC TGGAATACAC TGTGATGTCC AAGCGTAAGC TGAACCTGCT GGTGACCGAC
AAGCACGTTG AAGGCTGGGA TGACCCGCGT ATGCCGACCA TTTCCGGTCT GCGTCGTCGT
GGTTACACTG CAGCTTCTAT TCGTGAGTTC TGCAAACGCA TCGGCGTGAC CAAGCAGGAC
AACACCATTG AGATGGCGTC GCTGGAATCC TGCATCCGTG AAGATCTCAA CGAAAATGCG
CCGCGCGCAA TGGCGGTTAT CGATCCGGTG AAACTGGTTA TCGAAAACTA CCAGGGCGAA
GGCGAAATGG TTACCATGCC GAACCATCCG AATAAACCGG AAATGGGCAG CCGCCAGGTG
CCATTTAGCG GTGAGATTTG GATCGATCGC GCCGATTTCC GCGAAGAAGC TAACAAGCAG
TACAAACGTC TGGTGCTGGG TAAAGAAGTG CGTCTGCGTA ATGCTTACGT CATTAAGGCA
GAACGCGTGG AGAAAGATGC CGAAGGCAAT ATCACCACCA TCTTCTGTAC TTATGACGCC
GACACCTTAA GCAAAGATCC GGCAGATGGT CGTAAAGTCA AAGGTGTTAT TCACTGGGTG
AGCGCGGCAC ATGCGCTGCC GGTTGAAATC CGTTTGTATG ACCGTCTGTT CAGCGTGCCG
AACCCTGGTG CTGCGGATGA TTTCCTGTCG GTGATTAACC CGGAATCGCT GGTGATCAAA
CAGGGCTTTG CTGAACCGTC GCTGAAAGAT GCGGTTGCGG GTAAAGCATT CCAGTTTGAG
CGTGAAGGTT ACTTCTGCCT CGACAGCCGC CATTCTACGG CGGAAAAACC GGTATTTAAC
CGCACCGTTG GGCTGCGTGA TACCTGGGCG AAAGTAGGCG AGTAA
 
Protein sequence
MSEAEARPTN FIRQIIDEDL ASGKHTTVHT RFPPEPNGYL HIGHAKSICL NFGIAQDYKG 
QCNLRFDDTN PVKEDIEYVE SIKNDVEWLG FHWSGNVRYS SDYFDQLHAY AIELINKGLA
YVDELTPEQI REYRGTLTQP GKNSPYRDRS VEENLALFEK MRTGGFEEGK ACLRAKIDMA
SPFIVMRDPV LYRIKFAEHH QTGNKWCIYP MYDFTHCISD ALEGITHSLC TLEFQDNRRL
YDWVLDNITI PVHPRQYEFS RLNLEYTVMS KRKLNLLVTD KHVEGWDDPR MPTISGLRRR
GYTAASIREF CKRIGVTKQD NTIEMASLES CIREDLNENA PRAMAVIDPV KLVIENYQGE
GEMVTMPNHP NKPEMGSRQV PFSGEIWIDR ADFREEANKQ YKRLVLGKEV RLRNAYVIKA
ERVEKDAEGN ITTIFCTYDA DTLSKDPADG RKVKGVIHWV SAAHALPVEI RLYDRLFSVP
NPGAADDFLS VINPESLVIK QGFAEPSLKD AVAGKAFQFE REGYFCLDSR HSTAEKPVFN
RTVGLRDTWA KVGE