Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1719 |
Symbol | |
ID | 6145277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1726867 |
End bp | 1727928 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641616595 |
Product | hypothetical protein |
Protein accession | YP_001743773 |
Protein GI | 170680680 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02276] 40-residue YVTN family beta-propeller repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0023319 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATTTAC GTCATCTGTT TTCATCGCGC CTGCGTGGTT CATTACTGTT AGGTTCATTG CTTGTTGCTT CATCATTCAG TACGCAGGCC GCAGAAGAAA TGCTGCGTAA AGCGGTAGGT AAAGGTGCCT ACGAAATGGC TTATAGCCAG CAAGAAAACG CGCTGTGGCT CGCCACTTCG CAAAGCCGCA AACTGGATAA AGGCGGCGTG GTTTATCGTC TTGATCCGGT TACTCTGGAA GTGACGCAGG CGATCCATAA CGATCTCAAG CCGTTTGGTG CCACCATCAA TAACACGACT CAGACGTTGT GGTTTGGTAA CACTGTAAAC AGCGCGGTCA CGGCGATAGA TGCCAAAACG GGCGAAGTGA AAGGACGTCT GGTGCTTGAT GAGCGTAAAC GTACCGAAGA AGTGCGTCCG TTGCAGCCGC GTGAGCTGGT AGCTGATGAT GCCACGAACA CCGTTTACAT CAGTGGTATT GGTAAAGAGA GCGTGATTTG GGTCGTTGAT GGCGAGAATA TCAAACTGAA AACCGCCATC CAGAACACCG GTAAAATGAG TACCGGTTTA GCGCTGGATA GCAAAGGCAA ACGTCTTTAC ACCACTAACG CTGACGGCGA ATTGATTACC ATCGACACCG CCGACAATAA AATCCTCAGC CGTAAAAAGC TGCTGGATGA CGACAAAGAG CACTTCTTTA TCAACATCAG CCTTGATACC ACCAATCAGC GTGCATTTAT CACCGATTCT AAAGCGGCAG AAGTGTTAGT GGTTGATACC CGTAATGGCA ATATTCTTGC GAAGGTTGCG GCACCGGAAT CACTGGCTGT GCTGTTTAAC CCAGCGCGTA ATGAAGCCTA CGTGACGCAT CGTCAGGCAG GTAAAGTCAG TGTGATTGAC GCGAAAAGCT ATAAAGTGGT GAAAACGTTC GATACGCCGA CTCATCCGAA CAGCCTGGCG CTGTCTGCAG ATGGCAAAAC GCTGTATGTC AGTGTGAAAC AAAAATCCAC TAAACAGCAG GAAGCTACCC AGCCGGACGA TGTGATTCGT ATTGCGCTGT AA
|
Protein sequence | MHLRHLFSSR LRGSLLLGSL LVASSFSTQA AEEMLRKAVG KGAYEMAYSQ QENALWLATS QSRKLDKGGV VYRLDPVTLE VTQAIHNDLK PFGATINNTT QTLWFGNTVN SAVTAIDAKT GEVKGRLVLD ERKRTEEVRP LQPRELVADD ATNTVYISGI GKESVIWVVD GENIKLKTAI QNTGKMSTGL ALDSKGKRLY TTNADGELIT IDTADNKILS RKKLLDDDKE HFFINISLDT TNQRAFITDS KAAEVLVVDT RNGNILAKVA APESLAVLFN PARNEAYVTH RQAGKVSVID AKSYKVVKTF DTPTHPNSLA LSADGKTLYV SVKQKSTKQQ EATQPDDVIR IAL
|
| |