Gene EcSMS35_1152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1152 
Symbol 
ID6143440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1174838 
End bp1177279 
Gene Length2442 bp 
Protein Length813 aa 
Translation table11 
GC content49% 
IMG OID641616030 
Productexonuclease family protein 
Protein accessionYP_001743217 
Protein GI170684076 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000192952 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.23987e-16 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCACTG ATAAAGAAGA ATTTGCACTA TATTGCGAAG CAAAAAATGA CAAAGTAAGA 
AAACGCCTGG GAATTAAAGG TGGTTTTTAC TGGACTACAG CAAAAAAATT ATCTGTTGCA
ATCTCCCGCT GCATTACCGC AATGGATGAC AACGATTATG ATGAAGACGA CTTTAAAAAA
CCCGTCCGCG TCAATTTGCC CGTTGTTGAC GACCTTCCGC CAGAAGGCGT GTTTGATACT
GAATTCTGCA ACCGCTATGA AAAAGGCGGG AAAGATGGCA TCACAATGAC ATTTATCGGC
CCTTCCCCCT CTGTTCAGGA CAAACCAGCC AGCACTGACA ATACCAACAT CAACGGCGAA
GACATGACTG AGATTGAGGA GAGCATGCTT CTGCCTGTCT CCGGTCAGGA ACTGCCCATT
CGTTGGCTTG CTCAACACGG CAGCGAAAAA CCAGTAACGC ACGTTTCACG CGACGAACTC
CAGGCATTAC ACATTGCACG GGCTGAAGAA CTACCGGCTG TTACTGCCCT GGCTATTTCG
CATAAAACCA GTCTGCTCGA CTCGCTGGAG ATTCGCGACC TCCACAAACT GGTTCGTGAC
ACTGACAAAG TTTTCCCTAA TCCTGGTAAT TCAGACCTGG GACTAATAAC TGCTTTTTTC
GAAGCATACC TGGACGCTGA CTACACTGAT CGGGGTCTGC TGACAAAAGA GTGGATGAAA
GGAAATCGTG TTTCACGCAT CACCCGCACG GCTTCCGGTG CTAATGCTGG CGGTGGGAAC
AAAACCGATC GCAATCCGAA TTTAGTACAC ACCCTCGACA CACTGGATGT GGAGATTGCA
GCAGCCACAC TTCCGATGGA TTTTAATATT TATGAAATTC CGGGCAGCGT TTATCGTCGC
GCAAAAGAAG TAGTCCTGAA CAAAGAAAGT CCGTTCAAAG AATGGTCCGC AGCACTTCGT
GCAACCCCGG GTATTCTGGA CTATTCCCGC GCCGCTATTT TTGCACTTAT CCGAAGCGCA
CACCCTGAAT TTTATCACTA CCCGGGACGC CTTCAGGGGT ATATCAACGC CTATTTGACG
GAAACTGATC ACGAGAACCC CAGCAAGGAA ACTCTCACAG CTGCCCGGCA TACGCCGGAA
AAAGATATCC TGGAAGAAAT TAACCGCGAG GTGGTTACTG AGCGTGAAAC AGAAGAAGAA
AAACCACAAC CATCTGACGC AATGGCAGGT GAACAGGCAA CAACTGAAAC AATGGAACCG
GATACAACTG AACATGGCCA GAACGCGCAG TCGCTGGATG CTCAGTCGCA GGTGAGTTCC
GCTAACCAAG TAAAAGTCAC CGCTGACGAA GTAAACAAAA TTATGCAGGC AGCCAATATC
AGCCAGCCTG ACGCCGATAA GTTACTTGCT GTATCGCGTG GTGAATTTGT TGAGGGGATT
AGCGACCCTA ATGATCCGAA ATGGGTCAAG GGGATCCAGA CTCGCGATTC TGTGAACCAG
AACCAGCATG AATCGGAACG GAACGACCAA AAAGCGGAAC AAAACAGCCC AAATGCGTTA
CAAAACGAGC CAGAAACGAA ACAATCCGAA CCAGTAGCGC AACAGGAACC GGAAAAAGTC
TGCACCGCCT GCGGTCAGAG CGGTGGCGGC AACTGCCCTG ATTGTGGCGC GGTGATGGGC
GACGCAACAT ACCAGGAAAC ATTCGATGAA GAGAATCAGG TTGAAGTTCA GGAAAATGAT
CCGGAGGAAA TGGAAGGCGC TGAACATCCA CACAAGGAGA ACCCTGGCGG CAATCAGCAT
CACGCCAGCG ATAATAAAAC TGGCGAGACG GCAGATCACC CAATTAAGGT GAACGGTCAT
CACGAAATCA CATCCACCAG CAGGACGTGT GACCATCTAA TGATCGACCT TGAAACCATG
GGAAAAAATC CTGATGCCCC GATCATCTCA ATAGGTGCAA TATTTTTCGA TCCGCAAACC
GGAGATATGG GACCGGAATT TAGTAAGACT ATCGATCTGG AAACTGCTGG CGGAGTCATT
GATCGGGACA CCATTAAATG GTGGCTTAAG CAATCACGCG AAGCGCAATC TGCCATTATG
ACCGATGAAA TCCCGTTAGA TGATGCACTG TTACAATTGC GGGAATTTAT CGACGAAAAC
TCCGGTGAAT TTTTTGTTCA GGTTTGGGGA AATGGAGCCA ACTTCGACAA CACGATTTTG
CGCCGTTCAT ACGAACGGCA GGGGATCCCC TGCCCGTGGC GTTACTACAA CGATCGCGAT
GTACGCACAA TCGTTGAGCT GGGGAAAGCC ATAGACTTCG ATGCCAGAAC GGCTATTCCA
TTCGAAGGTG AGCGCCATAA TGCACTTGAT GACGCTCGTT ACCAGGCAAA ATACGTTTCA
GCTATCTGGC AAAAACTGAT CCCGAGTCAG GCTGATTTTT AA
 
Protein sequence
MSTDKEEFAL YCEAKNDKVR KRLGIKGGFY WTTAKKLSVA ISRCITAMDD NDYDEDDFKK 
PVRVNLPVVD DLPPEGVFDT EFCNRYEKGG KDGITMTFIG PSPSVQDKPA STDNTNINGE
DMTEIEESML LPVSGQELPI RWLAQHGSEK PVTHVSRDEL QALHIARAEE LPAVTALAIS
HKTSLLDSLE IRDLHKLVRD TDKVFPNPGN SDLGLITAFF EAYLDADYTD RGLLTKEWMK
GNRVSRITRT ASGANAGGGN KTDRNPNLVH TLDTLDVEIA AATLPMDFNI YEIPGSVYRR
AKEVVLNKES PFKEWSAALR ATPGILDYSR AAIFALIRSA HPEFYHYPGR LQGYINAYLT
ETDHENPSKE TLTAARHTPE KDILEEINRE VVTERETEEE KPQPSDAMAG EQATTETMEP
DTTEHGQNAQ SLDAQSQVSS ANQVKVTADE VNKIMQAANI SQPDADKLLA VSRGEFVEGI
SDPNDPKWVK GIQTRDSVNQ NQHESERNDQ KAEQNSPNAL QNEPETKQSE PVAQQEPEKV
CTACGQSGGG NCPDCGAVMG DATYQETFDE ENQVEVQEND PEEMEGAEHP HKENPGGNQH
HASDNKTGET ADHPIKVNGH HEITSTSRTC DHLMIDLETM GKNPDAPIIS IGAIFFDPQT
GDMGPEFSKT IDLETAGGVI DRDTIKWWLK QSREAQSAIM TDEIPLDDAL LQLREFIDEN
SGEFFVQVWG NGANFDNTIL RRSYERQGIP CPWRYYNDRD VRTIVELGKA IDFDARTAIP
FEGERHNALD DARYQAKYVS AIWQKLIPSQ ADF