Gene EcSMS35_1865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1865 
Symbol 
ID6146650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1887771 
End bp1889666 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content47% 
IMG OID641616741 
Producthypothetical protein 
Protein accessionYP_001743919 
Protein GI170681982 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000596823 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAGGGA AGTTTCGCTG CATTTTGCTG TTGATAGTTG GGCTTTTTTT CTCTTCGTTG 
AGTTATGCGA AAAACACGGA GATCCCTTCT TATGAAGAAG GGATCTCTCT CTTTGATGTT
GAAGCCACTC TGCAACCGGA TGGGGTGCTC GACATCAAAG AAAATATTCA TTTTCAGGCG
CGAAATCAGC AGATTAAGCA CGGATTTTAT CGTGATTTAC CACGACTCTG GATGCAGCCT
GATGGGGACG CTGCACTGCT GAACTATCAT ATTGTTGGCG TCACCCGTGA TGGTATTCCT
GAACCCTGGC ATCTTGACTG GCATATCGGG TTAATGAGTA TTGTCGTGGG CGATAAACAA
CGTTTCTTGC CTCAAGGCGA CTATCATTAT CAAATTCATT ATCAGGTTAA AAATGCTTTC
CTGCGTGAAG GGGATTCTGA TCTGCTAATC TGGAACGTGA CCGGTAACCA CTGGCCGTTT
GAAATTTATA AGACCCTATT TTCACTCAAG TTGCCAGATA TTGCGGGTAA TCCATTTAGC
GAAATCGATC TCTTTACTGG AGAAGAGGGC GACACATATC GAAATGGCCG CATCCTTGAG
GACGGAAGAA TTGAATCCCG CGATCCGTTT TATCGTGAAG ATTTCACGGT CCTCTACCGC
TGGCCTCACG CTTTACTTGG TAATGCCCCG GCACCACAAA CGACGAATAT TTTCAGCCAT
CTTCTTTTAC CCTCTACGTC ATCGTTGTTA ATTTGGTTTC CGTGTCTCTT CCTGGCGTGT
GGATGGTTAT ATCTCTGGAA GCGCAGGCCG CAATTTACGC CGGTAGATGT GATTGAAACC
GATGTCATTC CGCCAGATTA CACACCCGGC ATGTTACGTC TCGATGCGAA GCTGGTTTAC
GACGATAAAG GTTTTTGTGC CGATATCGTA AATCTGATTG TTAAAGGAAA AATTCATCTG
GAAGATCAGT ATGACAAGAA CCAGCAAATC CTGATTTGTG TTAATGAAGG CGCGACCAGA
AATAATGCGG TATTACTGCC CGCAGAGCAG TTATTACTGG AAGCGTTATT TCGTAAAGGC
GATAAGGTCG TTCTTACGGG GAGACGCAAC AGAGTCTTAC GCAGGGCATT TTTACGGATG
CAGAAATTTT ATCTGCCGCG TAAAAAGTCT TCGTTTTATC GACCAGATAC GTTTTTGCAA
TGGGGCGGAA TGGCGATATT GGCGGTCATT CTCTACGGTA ACCTGAGTCC CGTAGGTTGG
GCAGGAATGA GTCTGGTTGG CGATATGTTT ATTATGATCT GCTGGCTTCT TCCTTTTTTA
TTTTGTTCCC TTGAGCTTTT GTTTGCCCGC GATGATGACA AGCCTTGCGT TAATCGTGTA
ATCATCACTT TGTTTTTACC GCTGATTTGT TCAGGCGTGG CCTTTTATTC TCTCTATATC
AATGTCGGAG ATGTATTCTT TTACTGGTAT ATGCCAGCGG GTTATTTTAG CGCTGTTTTC
CTGACCGGTT ATCTCACTGG CATGGGGTAT ATTTTTCTGC CAAAGTTTAC CCAAACTGGG
CAGCAACGTT ATGCCCACGG TGAAGCTATC GTTAACTATC TTGCGCGTAA AGAGGCAGCA
ACACACAGTG GGCGGCGGCG GAAAGGGGAA ACACGGAAAC TGGATTACGC GTTGCTAGGT
TGGGCAGTCT CGGCAAACCT TGGAAAAGAA TGGGCAGCAC GTATCACCCC ATCACTCACA
GCGGCTGTTC ACGCCCCGGA AATTGCCCGT AGTGGCGTTT TGTTTTCATT ACAGATGCAC
CTGAGCCTGG GGGCCAATAC CAGTTTGTTG GGGCGAAGTT ATTCCGGTGG TGGTGCTGGC
GGCGGGGCGG GTGGCGGAGG CGGTGGTGGC TGGTAA
 
Protein sequence
MAGKFRCILL LIVGLFFSSL SYAKNTEIPS YEEGISLFDV EATLQPDGVL DIKENIHFQA 
RNQQIKHGFY RDLPRLWMQP DGDAALLNYH IVGVTRDGIP EPWHLDWHIG LMSIVVGDKQ
RFLPQGDYHY QIHYQVKNAF LREGDSDLLI WNVTGNHWPF EIYKTLFSLK LPDIAGNPFS
EIDLFTGEEG DTYRNGRILE DGRIESRDPF YREDFTVLYR WPHALLGNAP APQTTNIFSH
LLLPSTSSLL IWFPCLFLAC GWLYLWKRRP QFTPVDVIET DVIPPDYTPG MLRLDAKLVY
DDKGFCADIV NLIVKGKIHL EDQYDKNQQI LICVNEGATR NNAVLLPAEQ LLLEALFRKG
DKVVLTGRRN RVLRRAFLRM QKFYLPRKKS SFYRPDTFLQ WGGMAILAVI LYGNLSPVGW
AGMSLVGDMF IMICWLLPFL FCSLELLFAR DDDKPCVNRV IITLFLPLIC SGVAFYSLYI
NVGDVFFYWY MPAGYFSAVF LTGYLTGMGY IFLPKFTQTG QQRYAHGEAI VNYLARKEAA
THSGRRRKGE TRKLDYALLG WAVSANLGKE WAARITPSLT AAVHAPEIAR SGVLFSLQMH
LSLGANTSLL GRSYSGGGAG GGAGGGGGGG W