Gene EcSMS35_3918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3918 
SymbolselB 
ID6144997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3989693 
End bp3991537 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content57% 
IMG OID641618744 
Productselenocysteinyl-tRNA-specific translation factor 
Protein accessionYP_001745883 
Protein GI170681383 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG3276] Selenocysteine-specific translation elongation factor 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00475] selenocysteine-specific elongation factor SelB
[TIGR00485] translation elongation factor TU 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.696319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATTG CGACTGCCGG ACACGTTGAC CACGGCAAAA CAACCTTGTT GCAGGCGATT 
ACGGGCGTAA ATGCCGACCG TCTGCCGGAA GAAAAAAAGC GCGGCATGAC CATCGATCTC
GGCTATGCCT ACTGGCCGCA GCCGGATGGT CGCGTGCCTG GTTTTATCGA CGTACCCGGT
CATGAAAAGT TTCTTTCCAA CATGCTGGCG GGCGTTGGTG GTATTGATCA CGCGCTGTTG
GTGGTGGCGT GCGATGACGG CGTGATGGCA CAGACCCGTG AGCATCTGGC GATTTTGCAG
CTGACCGGTA ACCCGATGCT GACAGTGGCG CTGACCAAAG CCGATCGCGT GGACGAAGCG
CGTGTTGATG AGGTTGAACG CCAGGTAAAG GAGGTGCTGC GGGAATACGG TTTTGCTGAG
GCAAAACTGT TTATCACCGC AGCAACCGAA GGTCGGGGAA TTGATGCCCT GCGCGAGCAT
CTGCTTCAGT TACCAGAACG CGAACACGCC AGCCAACATA GTTTCCGCCT CGCGATCGAC
CGCGCATTTA CCGTAAAAGG TGCCGGGCTG GTCGTTACCG GTACGGCGTT AAGCGGGGAA
GTGAAGGTAG GCGATTCAAT CTGGCTGACT GGTGTAAATA AACCGATGCG CGTGCGTGCG
CTACATGCGC AAAATCAGCC AACAGAAACT GCCCATGCCG GGCAGCGTAT CGCGCTTAAC
ATCGCGGGTG ATGCGGAAAA AGAGCAGATT AACCGTGGCG ACTGGCTGCT TGCCGATGTG
CCGCCGGAGC CGTTCACACG GGTGATTGTC GAGCTTCAAA CCCATACACC GCTGACCCAG
TGGCAGCCGC TGCATATTCA CCACGCCGCC AGCCACGTTA CGGGACGCGT TTCGCTGCTG
GAAGATAACC TTGCCGAGCT GGTGTTCGAC ACGCCGCTGT GGCTGGCGGA TAACGATCGC
ATAGTGCTGC GCGATATCTC GGCCCGCAAC ACGCTGGCGG GGGCGCGTGT GGTGATGCTT
AACCCGCCGC GTCGCGGTAA GCGTAAGCCG GAATATCTGC AATGGCTGGC GTCTCTTGCA
CGGGCGCAGA GCGATGCCGA TGCGTTATCT GTTCATCTGG AACGCGGCGC GGTTAACCTT
GCTGATTTCG CCTGGGCGCG CCAGCTCAAC GGCGAAGGGA TGCGCGAACT GCTGCAACAG
CCTGGCTATA TTCAGGCTGG TTATAGCTTG TTGAATGCGC CGGTTGCCGC CCGCTGGCAG
CGGAAAATTC TCGACACATT AGCGACTTAT CATGAGCAAC ATCGCGATGA ACCTGGTCCT
GGGCGCGAAC GTCTGCGACG TATGGCGTTG CCAATGGAAG ATGAAGCGCT GGTACTGTTG
CTGATTGAAA AGATGCGCGA AAGCGGCGAC ATCCTCAGTC ATCACGGCTG GCTGCATCTG
CCGGATCACA AAGCGGGCTT CAGCGAAGAG CAGCAGGCCA TCTGGCAAAA AGCAGAGCCA
CTGTTTGGTG ACGAACCGTG GTGGGTGCGA GACCTGGCAA AAGAGACGGG AACCGACGAG
CAGGCAATGC GCCTGACTCT ACGCCAGGCG GCGCAGCAGG GGATAATTAC CGCGATCGTT
AAAGATCGTT ATTACCGTAA CGATCGGATT GTCGAGTTTG CTAATATGAT CCGCGATCTC
GATCAGGAGT GTGGTTCAAC CTGCGCGGCG GATTTCCGCG ATCGCTTAGG CGTAGGCCGA
AAGCTGGCAA TTCAGATTCT GGAATATTTT GACCGCATTG GCTTTACGCG TCGTCGTGGA
AATGATCATT TATTACGCGA CGCATTATTA TTTCCGGAAA AATAA
 
Protein sequence
MIIATAGHVD HGKTTLLQAI TGVNADRLPE EKKRGMTIDL GYAYWPQPDG RVPGFIDVPG 
HEKFLSNMLA GVGGIDHALL VVACDDGVMA QTREHLAILQ LTGNPMLTVA LTKADRVDEA
RVDEVERQVK EVLREYGFAE AKLFITAATE GRGIDALREH LLQLPEREHA SQHSFRLAID
RAFTVKGAGL VVTGTALSGE VKVGDSIWLT GVNKPMRVRA LHAQNQPTET AHAGQRIALN
IAGDAEKEQI NRGDWLLADV PPEPFTRVIV ELQTHTPLTQ WQPLHIHHAA SHVTGRVSLL
EDNLAELVFD TPLWLADNDR IVLRDISARN TLAGARVVML NPPRRGKRKP EYLQWLASLA
RAQSDADALS VHLERGAVNL ADFAWARQLN GEGMRELLQQ PGYIQAGYSL LNAPVAARWQ
RKILDTLATY HEQHRDEPGP GRERLRRMAL PMEDEALVLL LIEKMRESGD ILSHHGWLHL
PDHKAGFSEE QQAIWQKAEP LFGDEPWWVR DLAKETGTDE QAMRLTLRQA AQQGIITAIV
KDRYYRNDRI VEFANMIRDL DQECGSTCAA DFRDRLGVGR KLAIQILEYF DRIGFTRRRG
NDHLLRDALL FPEK