Gene EcSMS35_0317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0317 
Symbol 
ID6146558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp323181 
End bp325706 
Gene Length2526 bp 
Protein Length841 aa 
Translation table11 
GC content55% 
IMG OID641615213 
Producthypothetical protein 
Protein accessionYP_001742421 
Protein GI170682860 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0205782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.00830158 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTTTAC GACGGTTCTC CCCAGGACTG AAAGCCCAGT TTGCCTTCGG CATGGTCTTT 
TTGTTCGTTC AGCCCGATGC CAGCGCTGCT GACATAAGTG CGCAGCAAAT AGGTGGGGTG
ATTATTCCGC AGGCCTTCAG TCAGGCGCTC CAGGACGGCA TGAGCGTCCC GCTTTATATT
CATCTCGCCG GTAGCCAGGG TCGCCAGGAC GATCAGCGAA TCGGCAGCGC TTTTATCTGG
TTGGATGATG GACAGCTACG CATCCGGAAA ATACAGCTGG AAGAGAGTGA AGATAACGCC
AGTGTCAGCG AACAAACTCG ACAGCAGCTG ATGACTCTGG CCAACGCCCC GTTCAATGAG
GCCCTTACCA TCCCCCTGAC TGACAACGCG CAGTTGGATC TCAGCTTGCG CCAACTGCTG
CTGCAGCTGG TGGTCAAGCG CGAAGCGCTG GGCACCGTAC TACGCTCACG TAGCGAAGAC
ATCGGGCAGT CCAGTGTTAA CACCCTCAGC AGTAATCTGA GCTATAACTT CGGCGTCTAT
AACAACCAGT TGCGTAACGG CGGGAGCAAC ACATCCAGCT ATCTGTCGCT GAATAACGTT
ACTGCACTGC GCGAACATCA TGTGGTGCTC GACGGCTCGC TGTACGGGAT CGGTAGCGGT
CAACAGGACA GTGAATTATA TAAAGCGATG TATGAACGCG ATTTTGCCGG TCACCGATTT
GCCGGTGGAA TGCTCGACAC CTGGAACTTG CAGTCCTTAG GGCCGATGAC CGCCATTTCA
GCAGGGAAGA TTTACGGCCT TTCCTGGGGA AACCAGGCCA GCTCCACCAT CTTCGACAGC
AGCCAGTCAG CCACGCCAGT GATCGCCTTT TTACCGGCGG CGGGCGAAGT ACATCTCACC
CGTGATGGGC GGTTACTAAG CGTTCAGAAC TTCACCATGG GCAATCATGA AGTGGATACC
CGGGGTCTAC CGTACGGTAT TTACGATGTG GAAGTTGAGG TCATCGTTAA CGGTCGCGTG
ATTAGCAAAC GCACTCAACG GGTCAATAAG CTGTTTAGCC GGGGGCGCGG CGTCGGTGCA
CCACTGGCGT GGCAGATATG GGGCGGTAGC TTTCATATGG ATCGCTGGTC GGAAAACGGG
AAAAAGACGC GACCAGCTAA AGAGAGTTGG CTGGCAGGTG CCTCGACCTC CGGCTCACTG
AGTACGCTTA GCTGGGCGGC AACGGGATAT GGATACGATA ATCAGGCGGT GGGTGAAACC
CGTCTGACGC TGCCGCTTGG GGGGGCGATC AACGTTAACC TGCAAAACAT GCTGGCCAGT
GACAGCTCAT GGAGCAACAT CGCCAGCATC AGCGCCACTC TACCTGGAGG CTTCAGTTCG
CTGTGGGTTA ACCAGGAAAA AACCCGCATT GGCAATCAAT TGCGACGTAG CGATGCCGAC
AACCGTGCAA TCGGCGGCAC ACTCAACCTG AACTCACTGT GGTCGAAGCT GGGTACGTTC
AGCATCAGCT ACAATGATGA CCGCCGCTAC AACAGCCATT ATTACACGGC AGATTACTAT
CAAAATGTCT ACAGCGGTAC CTTTGGTTCG CTTGGCCTGC GGGCCGGTAT TCAGCGCTAT
AACAACGGCG ACAGCAGCGC CAATACAGGG AAATATATCG CTCTCGATCT CTCGCTACCA
CTGGGCAACT GGTTTAGCGC AGGGATGACC CATCAAAACG GCTACACCAT GGCAAACCTG
TCAGCACGCA AACAGTTTGA TGAAGGAACC ATTCGCACTG TTGGTGCCAA TCTGTCACGA
GCCATCTCCG GCGATACCGG TGATGACAAA ACTCTCAGTG GTGGGGCGTA TGCACAGTTC
GACGCTCGTT ACGCCAGCGG AACGCTGAAC GTCAATAGCG CGGCGGACGG CTACATCAAT
ACTAATTTGA CCGCCAACGG CAGCGTCGGC TGGCAGGGTA AAAACATTGC TGCCAGCGGG
CGGACTGATG GCAACGCTGG GGTGATATTC AACACCGGGC TGGAGGACGA CGGTCAGATC
AGCGCCAAAA TCAACGGACG GATTTTCCCG CTTAACGGCA AGCGTAACTA TCTCCCGCTC
TCTCCCTATG GAAGATATGA GGTGGAGTTA CAGAACAGCA AAAACTCACT CGACAGTTAC
GATATCGTCA GCGGTCGCAA AAGTCATCTG ACTCTCTATC CAGGCAATGT CGCTGTCATT
GAGCCAGAGG TGAAGCAGAT GGTTACCGTC TCCGGTCGTA TCCGTGCGGA AGACGGCACA
CTGCTGGCTA ACGCACGGAT TAACAACCAT ATCGGCCGAA CCCGAACCGA TGAAAACGGC
GAGTTTGTCA TGGACGTGGA TAAGAAATAC CCCACTATCG ATTTTCGCTA CAGTGGCAAT
AAAACCTGCG AAGTGGCACT GGAACTCAAT CAGGCGCGCG GTGCAGTCTG GGTCGGTGAT
GTGGTCTGCA GCGGCCTCTC ATCGTGGGCG GCGGTGACGC AGACAGGAGA AGAGAATGAG
AGTTAA
 
Protein sequence
MPLRRFSPGL KAQFAFGMVF LFVQPDASAA DISAQQIGGV IIPQAFSQAL QDGMSVPLYI 
HLAGSQGRQD DQRIGSAFIW LDDGQLRIRK IQLEESEDNA SVSEQTRQQL MTLANAPFNE
ALTIPLTDNA QLDLSLRQLL LQLVVKREAL GTVLRSRSED IGQSSVNTLS SNLSYNFGVY
NNQLRNGGSN TSSYLSLNNV TALREHHVVL DGSLYGIGSG QQDSELYKAM YERDFAGHRF
AGGMLDTWNL QSLGPMTAIS AGKIYGLSWG NQASSTIFDS SQSATPVIAF LPAAGEVHLT
RDGRLLSVQN FTMGNHEVDT RGLPYGIYDV EVEVIVNGRV ISKRTQRVNK LFSRGRGVGA
PLAWQIWGGS FHMDRWSENG KKTRPAKESW LAGASTSGSL STLSWAATGY GYDNQAVGET
RLTLPLGGAI NVNLQNMLAS DSSWSNIASI SATLPGGFSS LWVNQEKTRI GNQLRRSDAD
NRAIGGTLNL NSLWSKLGTF SISYNDDRRY NSHYYTADYY QNVYSGTFGS LGLRAGIQRY
NNGDSSANTG KYIALDLSLP LGNWFSAGMT HQNGYTMANL SARKQFDEGT IRTVGANLSR
AISGDTGDDK TLSGGAYAQF DARYASGTLN VNSAADGYIN TNLTANGSVG WQGKNIAASG
RTDGNAGVIF NTGLEDDGQI SAKINGRIFP LNGKRNYLPL SPYGRYEVEL QNSKNSLDSY
DIVSGRKSHL TLYPGNVAVI EPEVKQMVTV SGRIRAEDGT LLANARINNH IGRTRTDENG
EFVMDVDKKY PTIDFRYSGN KTCEVALELN QARGAVWVGD VVCSGLSSWA AVTQTGEENE
S