Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3709 |
Symbol | |
ID | 6142839 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3772886 |
End bp | 3774391 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641618535 |
Product | hypothetical protein |
Protein accession | YP_001745675 |
Protein GI | 170681684 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0226] ABC-type phosphate transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.447391 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAACG GAAAATGGAT TTTGACCTCG CTGGTAATGA CTTTTTTTGG CATCCCCATA CTGGCGCAAT TTTTGGCAGC GGTTGTTGCC ATGCTGGGGG CCGGGCTTGC CGCTATTTTT GATGTTTGCA ATTTACTCTT TACGCCAACA ATTTATCTTC TACTCAACGT CTTTATGCTG ACGCTGGGCG CATTACTGCT ATTTTTCTCT GGGCGAGTGT GGGCGGGCGA TAGCGCACCA GAAAACAGAG AAATAGCCGC CTGGCGACAA TGTCTTTTTT TAGTTCCCGC TTTATTAACC CTGGTTGGCT GGATAATCAC GCTACATCTG GCAGATTATC AATTTCGCCA GATGGTTTCA GGTTGGTTGG CAAACCTTAT GCTTCCCTGG TTGGGCGTTT TTACAGTCTC ATTCGTCGGT GGTGAGTACT GGTGGATAGT CATTATTCCC GTTGGCGCGC ATATCAGTTT TTCACTGGGA TACGGCTGGC CGACCAGACA CCCTTTAACC GGCACCTCCG GTCTACGTTG CCGTAATTTA CTTCTGTTCA TTCTTCTCTT ACTGGGTATT GTCGCTGGTT ATCAGGCTTA TTTATATAAA CAGCTTAATC CCGGCGTCGG TGTGCGTGAA AATATTGATA CCTGGGCCTG GCGACCCGAT AAACTTAATA ATCAACTGAC GCCACTGCGT GGTAAACCAC AAATTCAATT TACGCAAAAC TGGCCGCGAC TCGATGGCGC TACGGCGGCG TACCCCATTT ATGCCTCTGC ATTTTATGCA TTAAGTGTAA TACCAGAGGA TTTTCACGTT TGGGATTATC TGGATAACTC TCGTACGCAA GAAGCATATA ACAAAATCGT TAAGGGCGAT GCTGATATTA TTTTCGTGGC GCAACCTTCC GATGGGCAAA AAAAACGCGC TGAGAAATCG GGCGTCACTT TGCTGTACAC GCCATTTGCC CGTGAAGCGT TTGTTTTCAT CGTCAATGCG GATAATCCGG TTAATTCCCT GACTGAACAA CAAGTGCGTG ACATTTTTAG TGGCGCAATT ACCAACTGGC GCACGGTTGG CGGTAACGAT CAGGAGATCC AGACCTGGCA ACGCCCGGAA GACTCTGGCA GCCAGACAGT GATGCAATCG CAGGTCATGA AAAATGTCCG CATGATCTCG CCGCAGGAAA CCGAAGTGGC AAGCATGATG GAGGGAATGA TTAAAGTCGT TGCCGAATAC CGTAATACAA ACAACGCAAT AGGCTACACC TTCCGCTATT ACGCAACACA AATGAATGCC GATAAAAATA TAAAACTGCT AGCGATTAAC GGTATTGCAC CGACTGCGGA AAATATTCGT AACGGCAAAT ATCCGTATAT CATCGATGCC TTTATGGTAA CGCGGGAAAA TACTACGTCA GAAACACAAA AACTGGTCGA ATGGTTTTTA ACGCCGCAGG GACAGAGTCT GGTGGAAGAT GTGGGCTATG TGCCGCTGTA TCCAACAATG AAATAA
|
Protein sequence | MQNGKWILTS LVMTFFGIPI LAQFLAAVVA MLGAGLAAIF DVCNLLFTPT IYLLLNVFML TLGALLLFFS GRVWAGDSAP ENREIAAWRQ CLFLVPALLT LVGWIITLHL ADYQFRQMVS GWLANLMLPW LGVFTVSFVG GEYWWIVIIP VGAHISFSLG YGWPTRHPLT GTSGLRCRNL LLFILLLLGI VAGYQAYLYK QLNPGVGVRE NIDTWAWRPD KLNNQLTPLR GKPQIQFTQN WPRLDGATAA YPIYASAFYA LSVIPEDFHV WDYLDNSRTQ EAYNKIVKGD ADIIFVAQPS DGQKKRAEKS GVTLLYTPFA REAFVFIVNA DNPVNSLTEQ QVRDIFSGAI TNWRTVGGND QEIQTWQRPE DSGSQTVMQS QVMKNVRMIS PQETEVASMM EGMIKVVAEY RNTNNAIGYT FRYYATQMNA DKNIKLLAIN GIAPTAENIR NGKYPYIIDA FMVTRENTTS ETQKLVEWFL TPQGQSLVED VGYVPLYPTM K
|
| |