Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1152 |
Symbol | |
ID | 6143440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1174838 |
End bp | 1177279 |
Gene Length | 2442 bp |
Protein Length | 813 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641616030 |
Product | exonuclease family protein |
Protein accession | YP_001743217 |
Protein GI | 170684076 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000192952 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 2.23987e-16 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCACTG ATAAAGAAGA ATTTGCACTA TATTGCGAAG CAAAAAATGA CAAAGTAAGA AAACGCCTGG GAATTAAAGG TGGTTTTTAC TGGACTACAG CAAAAAAATT ATCTGTTGCA ATCTCCCGCT GCATTACCGC AATGGATGAC AACGATTATG ATGAAGACGA CTTTAAAAAA CCCGTCCGCG TCAATTTGCC CGTTGTTGAC GACCTTCCGC CAGAAGGCGT GTTTGATACT GAATTCTGCA ACCGCTATGA AAAAGGCGGG AAAGATGGCA TCACAATGAC ATTTATCGGC CCTTCCCCCT CTGTTCAGGA CAAACCAGCC AGCACTGACA ATACCAACAT CAACGGCGAA GACATGACTG AGATTGAGGA GAGCATGCTT CTGCCTGTCT CCGGTCAGGA ACTGCCCATT CGTTGGCTTG CTCAACACGG CAGCGAAAAA CCAGTAACGC ACGTTTCACG CGACGAACTC CAGGCATTAC ACATTGCACG GGCTGAAGAA CTACCGGCTG TTACTGCCCT GGCTATTTCG CATAAAACCA GTCTGCTCGA CTCGCTGGAG ATTCGCGACC TCCACAAACT GGTTCGTGAC ACTGACAAAG TTTTCCCTAA TCCTGGTAAT TCAGACCTGG GACTAATAAC TGCTTTTTTC GAAGCATACC TGGACGCTGA CTACACTGAT CGGGGTCTGC TGACAAAAGA GTGGATGAAA GGAAATCGTG TTTCACGCAT CACCCGCACG GCTTCCGGTG CTAATGCTGG CGGTGGGAAC AAAACCGATC GCAATCCGAA TTTAGTACAC ACCCTCGACA CACTGGATGT GGAGATTGCA GCAGCCACAC TTCCGATGGA TTTTAATATT TATGAAATTC CGGGCAGCGT TTATCGTCGC GCAAAAGAAG TAGTCCTGAA CAAAGAAAGT CCGTTCAAAG AATGGTCCGC AGCACTTCGT GCAACCCCGG GTATTCTGGA CTATTCCCGC GCCGCTATTT TTGCACTTAT CCGAAGCGCA CACCCTGAAT TTTATCACTA CCCGGGACGC CTTCAGGGGT ATATCAACGC CTATTTGACG GAAACTGATC ACGAGAACCC CAGCAAGGAA ACTCTCACAG CTGCCCGGCA TACGCCGGAA AAAGATATCC TGGAAGAAAT TAACCGCGAG GTGGTTACTG AGCGTGAAAC AGAAGAAGAA AAACCACAAC CATCTGACGC AATGGCAGGT GAACAGGCAA CAACTGAAAC AATGGAACCG GATACAACTG AACATGGCCA GAACGCGCAG TCGCTGGATG CTCAGTCGCA GGTGAGTTCC GCTAACCAAG TAAAAGTCAC CGCTGACGAA GTAAACAAAA TTATGCAGGC AGCCAATATC AGCCAGCCTG ACGCCGATAA GTTACTTGCT GTATCGCGTG GTGAATTTGT TGAGGGGATT AGCGACCCTA ATGATCCGAA ATGGGTCAAG GGGATCCAGA CTCGCGATTC TGTGAACCAG AACCAGCATG AATCGGAACG GAACGACCAA AAAGCGGAAC AAAACAGCCC AAATGCGTTA CAAAACGAGC CAGAAACGAA ACAATCCGAA CCAGTAGCGC AACAGGAACC GGAAAAAGTC TGCACCGCCT GCGGTCAGAG CGGTGGCGGC AACTGCCCTG ATTGTGGCGC GGTGATGGGC GACGCAACAT ACCAGGAAAC ATTCGATGAA GAGAATCAGG TTGAAGTTCA GGAAAATGAT CCGGAGGAAA TGGAAGGCGC TGAACATCCA CACAAGGAGA ACCCTGGCGG CAATCAGCAT CACGCCAGCG ATAATAAAAC TGGCGAGACG GCAGATCACC CAATTAAGGT GAACGGTCAT CACGAAATCA CATCCACCAG CAGGACGTGT GACCATCTAA TGATCGACCT TGAAACCATG GGAAAAAATC CTGATGCCCC GATCATCTCA ATAGGTGCAA TATTTTTCGA TCCGCAAACC GGAGATATGG GACCGGAATT TAGTAAGACT ATCGATCTGG AAACTGCTGG CGGAGTCATT GATCGGGACA CCATTAAATG GTGGCTTAAG CAATCACGCG AAGCGCAATC TGCCATTATG ACCGATGAAA TCCCGTTAGA TGATGCACTG TTACAATTGC GGGAATTTAT CGACGAAAAC TCCGGTGAAT TTTTTGTTCA GGTTTGGGGA AATGGAGCCA ACTTCGACAA CACGATTTTG CGCCGTTCAT ACGAACGGCA GGGGATCCCC TGCCCGTGGC GTTACTACAA CGATCGCGAT GTACGCACAA TCGTTGAGCT GGGGAAAGCC ATAGACTTCG ATGCCAGAAC GGCTATTCCA TTCGAAGGTG AGCGCCATAA TGCACTTGAT GACGCTCGTT ACCAGGCAAA ATACGTTTCA GCTATCTGGC AAAAACTGAT CCCGAGTCAG GCTGATTTTT AA
|
Protein sequence | MSTDKEEFAL YCEAKNDKVR KRLGIKGGFY WTTAKKLSVA ISRCITAMDD NDYDEDDFKK PVRVNLPVVD DLPPEGVFDT EFCNRYEKGG KDGITMTFIG PSPSVQDKPA STDNTNINGE DMTEIEESML LPVSGQELPI RWLAQHGSEK PVTHVSRDEL QALHIARAEE LPAVTALAIS HKTSLLDSLE IRDLHKLVRD TDKVFPNPGN SDLGLITAFF EAYLDADYTD RGLLTKEWMK GNRVSRITRT ASGANAGGGN KTDRNPNLVH TLDTLDVEIA AATLPMDFNI YEIPGSVYRR AKEVVLNKES PFKEWSAALR ATPGILDYSR AAIFALIRSA HPEFYHYPGR LQGYINAYLT ETDHENPSKE TLTAARHTPE KDILEEINRE VVTERETEEE KPQPSDAMAG EQATTETMEP DTTEHGQNAQ SLDAQSQVSS ANQVKVTADE VNKIMQAANI SQPDADKLLA VSRGEFVEGI SDPNDPKWVK GIQTRDSVNQ NQHESERNDQ KAEQNSPNAL QNEPETKQSE PVAQQEPEKV CTACGQSGGG NCPDCGAVMG DATYQETFDE ENQVEVQEND PEEMEGAEHP HKENPGGNQH HASDNKTGET ADHPIKVNGH HEITSTSRTC DHLMIDLETM GKNPDAPIIS IGAIFFDPQT GDMGPEFSKT IDLETAGGVI DRDTIKWWLK QSREAQSAIM TDEIPLDDAL LQLREFIDEN SGEFFVQVWG NGANFDNTIL RRSYERQGIP CPWRYYNDRD VRTIVELGKA IDFDARTAIP FEGERHNALD DARYQAKYVS AIWQKLIPSQ ADF
|
| |