Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2660 |
Symbol | |
ID | 6144501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2728135 |
End bp | 2729127 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617531 |
Product | putative adhesin |
Protein accession | YP_001744696 |
Protein GI | 170680922 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.468628 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCTC ATATGAAACG TGGCCTGACA AGAGTCGCGC TGGCACTGAT GCTGGCTGGA TACTGTGCCG TGCCAGCCGC TATGGCGGAA GATGCCGCCT GGGTAGCCAG TGGGACGACC GCTGAATTCG AAGGGACCAT ACCCTGGCTT TATCGCGAGG GTGGGAATGC CACGATTAAC TCGGATGATG CGGATCACAT CAAAGTAACG TCTGATGGTA AAGGTACTCG TCCTTCAGGT AGCGAGACGG ATAAACGCCT TTACTCAGGA GATACCATCA CGTTGGGGTG GGATATCGGC GATACCGAAG GGGATATCGA CGATGGTCCA GACGGTATTG ATGCGAAAAC GACAGCCACT ATCAAATGGT ACAGTTACAG CGATAATGCG GGTGGCGGTA AAACGGAGCT GACCGCAGCG GCAGGCAAGA CGAGTTATAA ACTCACTGAT GACGAACGTG GGCGTTATAT CGGTGTTGAA ATCCAGCCCA TCACCCAGAC CGGTAATCCT TTCCAGGGAA CATCGCTGAC TCTGCTGGAT ATTTCAACGG CCAGCGGTGG CGGCAGCGAT ACGGATAATG TTGATCCTGG CCCGGTTGTG AACCAGAACC TGAAAGTTGC CATCTTCGAG AAAGGCACCA GTACCAACCT TATCGGTGGT AATACAGCCA TTGCGCTTAA CAAAACGTAT GTGGCCAAAC TGTACTCGGA TGAAAACCAG AATGGTAAGT ACGATGCGGG TACGGATGTG GACGTTACCG CTAACTACGA CTTTGCGTGG GTCTTTAACG GTAACAGTAA ACAACTTGCG GCGGCGGGCG GTATTGCCAA CGCCAGCTTC GATAACAATG ACATTGTTAT TCCGCAAACT AACGAACAAG CAAGAACCAG CCTTAACGGT AGTGACCGTA ATGGTAAGAC CGGCCTTGCA ATCCCGGCAA ACGGCGACGG CGTTCAGGGC TATACGCTGT CTATCATTTA CAAACACCAC TAA
|
Protein sequence | MKPHMKRGLT RVALALMLAG YCAVPAAMAE DAAWVASGTT AEFEGTIPWL YREGGNATIN SDDADHIKVT SDGKGTRPSG SETDKRLYSG DTITLGWDIG DTEGDIDDGP DGIDAKTTAT IKWYSYSDNA GGGKTELTAA AGKTSYKLTD DERGRYIGVE IQPITQTGNP FQGTSLTLLD ISTASGGGSD TDNVDPGPVV NQNLKVAIFE KGTSTNLIGG NTAIALNKTY VAKLYSDENQ NGKYDAGTDV DVTANYDFAW VFNGNSKQLA AAGGIANASF DNNDIVIPQT NEQARTSLNG SDRNGKTGLA IPANGDGVQG YTLSIIYKHH
|
| |