Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3758 |
Symbol | |
ID | 6145708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3823613 |
End bp | 3824662 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641618584 |
Product | hypothetical protein |
Protein accession | YP_001745724 |
Protein GI | 170679689 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACCC CTCAACCCGA TAAAACGGGC ATGCACATTC TGCTCAAGCT GGCCTCGCTG GTGGTGATCC TCGCGGGAAT TCACGCAGCG GCAGATATCA TTGTGCAGCT GTTACTGGCG CTGTTTTTTG CCATCGTCCT CAACCCGCTC GTCACCTGGT TTATTCGTCG GGGAGTACAA CGCCCCGTTG CCATTACGAT TGTGGTGGTG GTGATGCTGA TCGCACTAAC CGCGCTGGTC GGCGTACTGG CGGCATCGTT TAACGAATTT ATCTCTATGC TGCCGAAGTT TAATAAGGAG CTGACGCGCA AACTTTTTAA ATTGCAGGAG ATGTTGCCTT TTCTTAATTT GCATATGTCG CCGGAGCGAA TGCTGCAGCG GATGGACTCG GAAAAAATGG TTACCTTCAC CACGGCGCTA ATGACCGGGC TTTCCGGGGC AATGGCGAGC GTGCTTTTGC TGGTGATGAC CGTAGTTTTC ATGCTGTTTG AAGTGCGCCA CGTCCCTTAC AAAATGCGTT TTGCGTTGAA TAATCCACAG ATTCACATCG CCGGACTACA CCGCGCCTTA AAAGGTGTTT CGCACTATCT TGCATTGAAG ACACTGCTAA GTTTATGGAC AGGCGTAATC GTCTGGCTGG GGCTGGCGCT AATGGGCGTA CAGTTTGCGC TGATGTGGGC AGTACTGGCG TTTTTGCTCA ACTACGTGCC CAATATCGGC GCGGTAATTT CCGCCGTACC GCCAATGATT CAGGTGCTGC TGTTTAATGG CGTTTACGAA TGTATTCTGG TCGGCGCATT GTTTTTAGTG GTCCATATGG TCATCGGCAA TATTTTAGAA CCACGGATGA TGGGCCATCG CCTGGGGATG TCCACCATGG TGGTATTTCT TTCATTGTTA ATTTGGGGAT GGCTGCTCGG CCCGGTAGGG ATGCTACTTT CGGTACCATT AACCAGCGTG TGTAAAATCT GGATGGAAAC CACCAAAGGC GGTAGCAAAC TGGCGATTTT ACTGGGACCG GGCAGACCGA AAAGTCGATT ACCGGGATGA
|
Protein sequence | METPQPDKTG MHILLKLASL VVILAGIHAA ADIIVQLLLA LFFAIVLNPL VTWFIRRGVQ RPVAITIVVV VMLIALTALV GVLAASFNEF ISMLPKFNKE LTRKLFKLQE MLPFLNLHMS PERMLQRMDS EKMVTFTTAL MTGLSGAMAS VLLLVMTVVF MLFEVRHVPY KMRFALNNPQ IHIAGLHRAL KGVSHYLALK TLLSLWTGVI VWLGLALMGV QFALMWAVLA FLLNYVPNIG AVISAVPPMI QVLLFNGVYE CILVGALFLV VHMVIGNILE PRMMGHRLGM STMVVFLSLL IWGWLLGPVG MLLSVPLTSV CKIWMETTKG GSKLAILLGP GRPKSRLPG
|
| |