Gene EcSMS35_1471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1471 
Symbol 
ID6146032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1453897 
End bp1455795 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content39% 
IMG OID641616349 
Productankyrin repeat-containing protein 
Protein accessionYP_001743529 
Protein GI170683130 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.103946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAAA ACGACATTAT TATCAGAACT CATTATAAGT CTCCTCATAG AATGCACATC 
GATAGCGACA TACCAACGCC TTCATCAGAG CCTATTAATC AATTTGCGCG CCAACTCATC
ACCCTACTTG ATACCTCTGA CTTAAGTTCG ATGCTGACAT ACTGTGTTAC TCAGGAATTT
ACCGCAAACT GTCGAAAAAT ATCACAAAAT TGTTATTCCA CTGCCCTTTT TACCATTAAC
TTTGCCACTT CACCCATCCA TGCAGAAAAT ATACTCATTA CATTACACTA TAAAAAAGAA
ATCATTTCCT TATTACTGGA AACCACGCCT ATTAAAGCTA ACCATTTGCG AAGCATACTG
GATTATATTG AACAGGAACA GTTAACTGCC GAAAATCGTA ACCATTGTAT GAAACTGTCT
AAAAAAATCC ATAGAGAAAA AACTATACAA CCAACAGTAA ATCTCAATGG TAGTGCATTT
TTTTTGCAAT CTCCTTCTGA CGCTATTTTT TGTCGCCATC TGTCATTGCA ATACGCCCTT
GATTCACTGA GAAATGGAAA AGGCAAAGTC AATCTGATTA AACATTACTC CTCCGTTGAA
TCCATACAGC AGCATGTTCC CTTAGTCCGG GACGCGGAGT TCAGATCATT ACTTCGCCAT
CCTCCTGCAG GGAGTCGCGT TATCGCGAGT AAGGATTTTG GCTTCGCTTT AGATATTTTC
TTCTGTCGAA TGATGGCAAA CAATGTCAGT CATATGTCCG CGATTTTATA TATAGACAAT
CATACTTTGT CAGTAAGGCT ACGAATAAAG CAGTCAGCGT ATGGGCAATT AAATTATGTT
GTGTCCGTTT ACGACCCGAA CGATACCAAC GTTGCCGTCA GAGGCACCCA CAGGACAGCA
CGGGGCTTTC TCTCGCTCGA TAAATTCATC AGTTCAGGTC CCGATGCTCA GACCTGGGCT
GATATGTATG TTCGCAACTG TGCAATTGCT TTTCTGCCCC TATTACCTGA GGGAGTTCCA
GGGGCTATTT TCGCGGGTAT TGCATCACGA ATGCCATTTG CCCCTATACA TCCATCGGCA
ATGTTGTTAA TAATGGCCAC AGGCCAGACT CAACAGCTTA TTACATTATT CAAACAGTTA
CCCATACTCC CTGAAAAAGA AATCAATGAA ATAATAACTG CGCAGAATAG CGTTGGTACA
CCTGCTTTAT TTCTGGCTAT GATGAACGGA CATACTGACA ACGTAAAAAT ATTTATGCAA
GAAATTCAGT CACTGGTAGA TAATCATATC ATTCATGAAG ATAATCTGGT TAAATTACTG
CAAACTAAAA GTGCTAACGA AACACCTGGA CTCTATATCT CCATGTTGTA TGGATTCGAT
GAAATAATCG ATATCTTTCT GAATGCATTA ACCACTCCAA TAGCACAAGA TCTTTTAAAC
AAAAAAATGG TAATGAATAT TTTAGCAATG AAAACACGTG ATGGTGAGCC AGGATTATAT
GCCGCAATGG AAAATAATCA CCCTTTGTGT GTCACACGGT TCCTCTCTAA AGTTTATGGA
ATCGCTGTTA AATATAACCT CAGCAAAATT AACATCATGG ATTTATTAAA AGGCGCAACA
GCACATGGAA CCCCTGCTTT ATACATCGCC ATGAGCAAGG GTAATAAAGA CGTCGTGTTA
TCTTATATAT CAACGCTGGG TACTTTTGCA AAAAAATATT CTTTTAGTCA ATGTCAGTTA
TTCACATTGT TGGCCGCTAA AAATCATGAC AACATGTCAG CTGTTCATAT AGCCATTCAT
CATAATCATT ATAAAACTGT AGAAACATAT TATGCAGCTA TAAATGTAAT CAGCCAAAGC
CTGAGCTTTA GTCCAGATGA ACTACAGGCG TATTTATAA
 
Protein sequence
MSQNDIIIRT HYKSPHRMHI DSDIPTPSSE PINQFARQLI TLLDTSDLSS MLTYCVTQEF 
TANCRKISQN CYSTALFTIN FATSPIHAEN ILITLHYKKE IISLLLETTP IKANHLRSIL
DYIEQEQLTA ENRNHCMKLS KKIHREKTIQ PTVNLNGSAF FLQSPSDAIF CRHLSLQYAL
DSLRNGKGKV NLIKHYSSVE SIQQHVPLVR DAEFRSLLRH PPAGSRVIAS KDFGFALDIF
FCRMMANNVS HMSAILYIDN HTLSVRLRIK QSAYGQLNYV VSVYDPNDTN VAVRGTHRTA
RGFLSLDKFI SSGPDAQTWA DMYVRNCAIA FLPLLPEGVP GAIFAGIASR MPFAPIHPSA
MLLIMATGQT QQLITLFKQL PILPEKEINE IITAQNSVGT PALFLAMMNG HTDNVKIFMQ
EIQSLVDNHI IHEDNLVKLL QTKSANETPG LYISMLYGFD EIIDIFLNAL TTPIAQDLLN
KKMVMNILAM KTRDGEPGLY AAMENNHPLC VTRFLSKVYG IAVKYNLSKI NIMDLLKGAT
AHGTPALYIA MSKGNKDVVL SYISTLGTFA KKYSFSQCQL FTLLAAKNHD NMSAVHIAIH
HNHYKTVETY YAAINVISQS LSFSPDELQA YL