Gene EcSMS35_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1920 
Symbol 
ID6146894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1942450 
End bp1943844 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content51% 
IMG OID641616796 
Producthypothetical protein 
Protein accessionYP_001743972 
Protein GI170683203 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.10975 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCCGTT TCGTTCCTCG CATTATTCCG TTTTATTTAC TCTTGCTGGC GGCAGGCGGT 
ACAGCTAACG CACAATCTAC CTTCGAGCAA AAAGCGGCAA ATCCCTTTGA TAATAACAAT
GATGGTCTGC CGGATTTAGG CATGGCACCT GAAAATCATG ATGGGGAAAA ACACTTTGCG
GAAATTGTGA AAGATTTCGG CGAAACCAGT ATGAATGATA ACGGGCTGGA TACTGGCGAG
CAGGCAAAAG CTTTCGCATT AGGAAAAGTC CGCGACGCGC TTAGTCAACA GGTTAATCAG
CACGTAGAGT CCTGGCTATC ACCGTGGGGA AATGCCAGTG TTGACGTCAA AGTGGATAAC
GAAGGACATT TCACCGGCAG TCGCGGAAGC TGGTTTGTGC CGTTACAAGA TAATGATCGT
TATCTCACCT GGAGCCAGCT TGGTCTTACT CAGCAGGATG ATGGCTTGGT GAGCAATGTG
GGCGTAGGGC AACGCTGGGC GCGCGGCAAC TGGCTGGTGG GTTATAACAC TTTTTATGAC
AACTTGCTGG ACGAAAATCT TCAGCGAGCG GGCTTTGGTG CCGAAGCGTG GGGCGAATAT
TTGCGACTAT CGGCAAACTT TTATCAGCCG TTTGCTGCAT GGCATGAACA GACAGCCACG
CAGGAACAGC GGATGGCGCG CGGGTACGAC CTGACAGCCC GGATGCGCAT GCCGTTCTAT
CAACACCTCA ATACCAGTGT CAGCGTAGAA CAGTATTTTG GTGATCGTGT CGATTTGTTT
AACTCTGGTA CGGGTTATCA CAATCCCGTC GCGTTGAGTC TGGGATTAAA TTACACCCCT
GTGCCCTTAG TCACTGTGAC GGCCCGGCAT AAACAGGGTG AAAGTGGCGA GAATCAAAAT
AACCTCGGGC TGAATCTTAA CTACCGCTTT GGTGTACCGC TCAAAAAACA ACTTTCTGCG
GGCGAAGTTG CCGAAAGTCA GTCGTTACGT GGTAGTCGCT ATGACAATCC GCAGCGAAAT
AATCTTCCGA CTCTTGAGTA CCGACAGCGA AAAACCTTAA CGGTGTTTCT GGCGACACCG
CCGTGGGATC TAAAACCTGG CGAAACAGTG CCGCTGAAAT TACAAATCCG CAGTCGTTAC
GGTATTCGGC AACTGATTTG GCAGGGCGAT ACGCAGATAT TAAGTTTGAC GCCGGGCGCA
CAAGCCAACA GTGAGGAGGG CTGGACGCTG ATCATGCCTG ACTGGCAAAA CGGGGAAGGC
GCAAGCAATC ACTGGCGATT GTCAGTAGTG GTGGAAGATA ACCAGGGGCA GCGTGTCTCC
TCCAATGAGA TCACGCTAAC GCTTGTCGAA CCGTTCGACG CATTGTCAAA CGACGAACTG
CGCTGGGAAC CGTAA
 
Protein sequence
MSRFVPRIIP FYLLLLAAGG TANAQSTFEQ KAANPFDNNN DGLPDLGMAP ENHDGEKHFA 
EIVKDFGETS MNDNGLDTGE QAKAFALGKV RDALSQQVNQ HVESWLSPWG NASVDVKVDN
EGHFTGSRGS WFVPLQDNDR YLTWSQLGLT QQDDGLVSNV GVGQRWARGN WLVGYNTFYD
NLLDENLQRA GFGAEAWGEY LRLSANFYQP FAAWHEQTAT QEQRMARGYD LTARMRMPFY
QHLNTSVSVE QYFGDRVDLF NSGTGYHNPV ALSLGLNYTP VPLVTVTARH KQGESGENQN
NLGLNLNYRF GVPLKKQLSA GEVAESQSLR GSRYDNPQRN NLPTLEYRQR KTLTVFLATP
PWDLKPGETV PLKLQIRSRY GIRQLIWQGD TQILSLTPGA QANSEEGWTL IMPDWQNGEG
ASNHWRLSVV VEDNQGQRVS SNEITLTLVE PFDALSNDEL RWEP