Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1471 |
Symbol | |
ID | 6146032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1453897 |
End bp | 1455795 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641616349 |
Product | ankyrin repeat-containing protein |
Protein accession | YP_001743529 |
Protein GI | 170683130 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.103946 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCAAA ACGACATTAT TATCAGAACT CATTATAAGT CTCCTCATAG AATGCACATC GATAGCGACA TACCAACGCC TTCATCAGAG CCTATTAATC AATTTGCGCG CCAACTCATC ACCCTACTTG ATACCTCTGA CTTAAGTTCG ATGCTGACAT ACTGTGTTAC TCAGGAATTT ACCGCAAACT GTCGAAAAAT ATCACAAAAT TGTTATTCCA CTGCCCTTTT TACCATTAAC TTTGCCACTT CACCCATCCA TGCAGAAAAT ATACTCATTA CATTACACTA TAAAAAAGAA ATCATTTCCT TATTACTGGA AACCACGCCT ATTAAAGCTA ACCATTTGCG AAGCATACTG GATTATATTG AACAGGAACA GTTAACTGCC GAAAATCGTA ACCATTGTAT GAAACTGTCT AAAAAAATCC ATAGAGAAAA AACTATACAA CCAACAGTAA ATCTCAATGG TAGTGCATTT TTTTTGCAAT CTCCTTCTGA CGCTATTTTT TGTCGCCATC TGTCATTGCA ATACGCCCTT GATTCACTGA GAAATGGAAA AGGCAAAGTC AATCTGATTA AACATTACTC CTCCGTTGAA TCCATACAGC AGCATGTTCC CTTAGTCCGG GACGCGGAGT TCAGATCATT ACTTCGCCAT CCTCCTGCAG GGAGTCGCGT TATCGCGAGT AAGGATTTTG GCTTCGCTTT AGATATTTTC TTCTGTCGAA TGATGGCAAA CAATGTCAGT CATATGTCCG CGATTTTATA TATAGACAAT CATACTTTGT CAGTAAGGCT ACGAATAAAG CAGTCAGCGT ATGGGCAATT AAATTATGTT GTGTCCGTTT ACGACCCGAA CGATACCAAC GTTGCCGTCA GAGGCACCCA CAGGACAGCA CGGGGCTTTC TCTCGCTCGA TAAATTCATC AGTTCAGGTC CCGATGCTCA GACCTGGGCT GATATGTATG TTCGCAACTG TGCAATTGCT TTTCTGCCCC TATTACCTGA GGGAGTTCCA GGGGCTATTT TCGCGGGTAT TGCATCACGA ATGCCATTTG CCCCTATACA TCCATCGGCA ATGTTGTTAA TAATGGCCAC AGGCCAGACT CAACAGCTTA TTACATTATT CAAACAGTTA CCCATACTCC CTGAAAAAGA AATCAATGAA ATAATAACTG CGCAGAATAG CGTTGGTACA CCTGCTTTAT TTCTGGCTAT GATGAACGGA CATACTGACA ACGTAAAAAT ATTTATGCAA GAAATTCAGT CACTGGTAGA TAATCATATC ATTCATGAAG ATAATCTGGT TAAATTACTG CAAACTAAAA GTGCTAACGA AACACCTGGA CTCTATATCT CCATGTTGTA TGGATTCGAT GAAATAATCG ATATCTTTCT GAATGCATTA ACCACTCCAA TAGCACAAGA TCTTTTAAAC AAAAAAATGG TAATGAATAT TTTAGCAATG AAAACACGTG ATGGTGAGCC AGGATTATAT GCCGCAATGG AAAATAATCA CCCTTTGTGT GTCACACGGT TCCTCTCTAA AGTTTATGGA ATCGCTGTTA AATATAACCT CAGCAAAATT AACATCATGG ATTTATTAAA AGGCGCAACA GCACATGGAA CCCCTGCTTT ATACATCGCC ATGAGCAAGG GTAATAAAGA CGTCGTGTTA TCTTATATAT CAACGCTGGG TACTTTTGCA AAAAAATATT CTTTTAGTCA ATGTCAGTTA TTCACATTGT TGGCCGCTAA AAATCATGAC AACATGTCAG CTGTTCATAT AGCCATTCAT CATAATCATT ATAAAACTGT AGAAACATAT TATGCAGCTA TAAATGTAAT CAGCCAAAGC CTGAGCTTTA GTCCAGATGA ACTACAGGCG TATTTATAA
|
Protein sequence | MSQNDIIIRT HYKSPHRMHI DSDIPTPSSE PINQFARQLI TLLDTSDLSS MLTYCVTQEF TANCRKISQN CYSTALFTIN FATSPIHAEN ILITLHYKKE IISLLLETTP IKANHLRSIL DYIEQEQLTA ENRNHCMKLS KKIHREKTIQ PTVNLNGSAF FLQSPSDAIF CRHLSLQYAL DSLRNGKGKV NLIKHYSSVE SIQQHVPLVR DAEFRSLLRH PPAGSRVIAS KDFGFALDIF FCRMMANNVS HMSAILYIDN HTLSVRLRIK QSAYGQLNYV VSVYDPNDTN VAVRGTHRTA RGFLSLDKFI SSGPDAQTWA DMYVRNCAIA FLPLLPEGVP GAIFAGIASR MPFAPIHPSA MLLIMATGQT QQLITLFKQL PILPEKEINE IITAQNSVGT PALFLAMMNG HTDNVKIFMQ EIQSLVDNHI IHEDNLVKLL QTKSANETPG LYISMLYGFD EIIDIFLNAL TTPIAQDLLN KKMVMNILAM KTRDGEPGLY AAMENNHPLC VTRFLSKVYG IAVKYNLSKI NIMDLLKGAT AHGTPALYIA MSKGNKDVVL SYISTLGTFA KKYSFSQCQL FTLLAAKNHD NMSAVHIAIH HNHYKTVETY YAAINVISQS LSFSPDELQA YL
|
| |