Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3708 |
Symbol | |
ID | 6147059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3771868 |
End bp | 3772860 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641618534 |
Product | hypothetical protein |
Protein accession | YP_001745674 |
Protein GI | 170683549 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0226] ABC-type phosphate transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.371578 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAT TCACCGGTGT TTTACTATTA GGTACGGCGC TACTGGCGGG ATGTGTCGAC CGGGAAGGGT ACTATAACAG CGTCAGGGAA GAAGACAGCC ATGGACTGAC GTCTCTGCGG GGGCAACCTG AATTACGTTA CAACGATGAT TGGTCAAGAT GGCCGAGAGT GTACGGCGCT ACAGCCTTAT ACCCGCTGTA TGCCTCCGCG TATTATGAAT TAGTACCCGA GCCAAAAGAT AAGGATCGAA CCTCGCTGGC CTGGCAGGCG TATGGTTTGC AACAAACCCG AACAGCTGAA GCCTACGATA GTCTGATTAA GGGTACCGCG ACGGTTATTT TTGTTGCACA ACCGTCGGAA GGACAGAAAA AACGTGCAGA AGAAGCAGGT GTTAAACTCA AATATACCGC TTTCGCCCGC GAAGCCTTTG TCTTTATCGT TGATATCAAA AACCCGGTTA ATTCACTTAG CGAACAGCAG GTCAAAGACA TTTTTAGCGG TAAAGTGAGT CGCTGGAATA AAGTGGGCGG CGGTGACGAA AGTATAAAAG TCTGGCAGCG GCCAGAAGAT TCTGGAAGCC AAACGGTTAT GAAGGGACTG GTTATGCAAG ATACTCCAAT GCTGCCAGCC AAAAAATCCA CTGTTATCGA TCTTATGGGC GGTTTAATTA CCGAAGTCGC CGACTACCAA AATACACCAT CTTCTATTGG ATACACCTTC CACTATTACG TCACACGTAT GAATGACAAT ATGCTCAAAA TGCGCAAACA GATCAAACTG CTGGCTATAA ATGGCGTTGC ACCTACAGAG GAAAATATCC GTAACGGCAC TTATCCGTAC ATAATTCATG CTTATATGGT GACGCACGAA AACCCTACGC TGGAAACGCA GAAATTCGTC GACTGGTTTT TAAGCCCGCA GGGACAGCAA TTGGTAGAGG ATGTGGGCTA TGTGCCGATT TATGACGCAT CATCCGAATC ATCAGGACAA TAA
|
Protein sequence | MNKFTGVLLL GTALLAGCVD REGYYNSVRE EDSHGLTSLR GQPELRYNDD WSRWPRVYGA TALYPLYASA YYELVPEPKD KDRTSLAWQA YGLQQTRTAE AYDSLIKGTA TVIFVAQPSE GQKKRAEEAG VKLKYTAFAR EAFVFIVDIK NPVNSLSEQQ VKDIFSGKVS RWNKVGGGDE SIKVWQRPED SGSQTVMKGL VMQDTPMLPA KKSTVIDLMG GLITEVADYQ NTPSSIGYTF HYYVTRMNDN MLKMRKQIKL LAINGVAPTE ENIRNGTYPY IIHAYMVTHE NPTLETQKFV DWFLSPQGQQ LVEDVGYVPI YDASSESSGQ
|
| |