Gene EcSMS35_1980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1980 
Symbol 
ID6144394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1999462 
End bp2001783 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content47% 
IMG OID641616856 
Productautotransporter (AT) family porin 
Protein accessionYP_001744032 
Protein GI170683308 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0125961 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAGGGTA ATAATACGAT TGTCACAACG GGGGATTACT CAATTGGTTT GCTAAGTCAA 
ACGAATGGAA ACCTGAATAC TGATACCATA ATAAGAGTCA ACACTGACGG TTCGGTTAAC
CCCCATTTAT CTGACGAAGG TAATACTTTT ATTGTTACTG CGGGTAATCA CGCAGTCGGC
GTTCTTGCTT GTGCATCTCC CGGAAGCGCG CGTGCGTGTG TATCTTCTCT TGATGAAGAA
AGTACTGCCG ATACAGGAAG TAATGAAAAT AACGCCAAAG CAAAACTGGA TATGGCAAAA
GGGGAGATAA CAACCCACGG AACAGAAAGT TATGCCGCTT ACGCTAATGG CACTGTCGTC
AAGGCGGGTA GCAAGCTTGA TTATACCAAT GCCAGCGTTA CATTAACCGA TGTTGATATT
ACCACGCATG GGGACAATGC TCATGCCATC GCTGCTCGCC AGGGGACTGT TTCATTTAAC
CAGGGAGAAA TTCATACAAC GGGTCCTGAC GCCGCGATTG CTAAAATTTA TAATGGCGGC
AAGGTGACGC TGAAAAATAC ATCTGCAGTC GCGCATCAGG GGGCTGGAAT TGTGCTGGAG
TCTTCCATAA ATGGTCAGGA AGCAACGGTA GATATTTTAT CAGGTAGCTC ACTACGGTCA
GCAAATGAAA TCCTCTACAA TAAAAATGAG ACGAGTAACG TGACCATTAC GGATAGTGAG
GTATCATCGG CTGCAGATGT TTTTATTAAT AATATTAAAG GTCATTTGGT CATCGATGCA
TCTAATTCAA AAATAACGGG TTCAGCCAAT CTTTCAACAG ATGATAGCAC TCATACCTAT
CTGTCGCTTT CAGATAATAG TACCTGGGAT ATTAAAACTG ACTCAACGGT GAGCAAGCTC
ACGGTTGACA ATAGTACGGT CTATATTTCC CGGGCAGATG GAAAGGCTTT CGAACCAACA
AGATTAACAA TAACTGAAAA TTACGTTGGT AATAATGGCG TATTGCATCT CAGAACTGAA
TTGGGTGATG ATAATTCAGC TACGGATAAA GTCGTTATTA ATGGAAATAC CTCTGGAACA
ACCCGAGTTA AAGTTACCAA TGCTGGCGGC AGCGGGGCTT ACACGTTAAA TGGGATAGAG
ATTATCAGCG TTGAGGGGGA ATCAAATGGA GAATTTATTA AGGATTCGAG GATTTTCGCC
GGTGCCTACG AATATTCATT AACCCGAGGT AATACCGAAG CGACCAATAA AAACTGGTAT
CTGACTAACT TCCAGGCAAC GAGCGGCGGT GAAACAAACT CCGGAGGAAG TTCAGCGCCT
ACTGTTGCGC CTACCCCCGT CCTGCGCCCC GAAGCTGGAA GTTACGTCGC CAACCTGGCA
GCCGCTAACA CTCTTTTTGT TATGCGTCTG AACGATCGTG CAGGTGAAAC GCGCTACATC
GATCCTGTAA CTGAACAGGA GCGTTCAAGC CGACTTTGGC TACGTCAAAT TGGCGGGCAT
AATGCCTGGC GTGACAGCAA CGGACAGTTG AGAACGACCT CGCATCGCTA CGTCTCGCAG
TTAGGGGCCG ATCTGTTAAC CGGTGGTTTT ACCGACAGTG ACAGTTGGCG TTTGGGAGTG
ATGGCTGGTT ATGCCCGCGA CTACAACTCA ACTCATTCCA GCGTGTCGGA TTATCGTTCG
AAAGGGAGTG TCAGAGGCTA TAGCGCAGGG CTGTATGCCA CCTGGTTTGC CGATGACATC
AGTAAAAAAG GCGCATACAT TGACGCCTGG GCGCAATATA GCTGGTTTAA AAACTCGGTG
AAAGGGGATG AGTTAGCCTA TGAATCCTAT AGCGCGAAAG GCGCAACCGT CTCGCTGGAA
GCGGGTTACG GCTTTGCCCT GAATAAATCC TTTGGTCTGG AAGCGGCGAA ATATACGTGG
ATCTTCCAGC CACAGGCACA GGCTATCTGG ATGGGCGTCG ATCATAATGC GCACACGGAA
GCCAATGGCT CACGTATTGA GAATGACGCA AATAACAACT TCCAGACCCG ACTTGGTTTC
CGCACCTTTA TTCGTACTCA GGAGAAAAAC AGCGGTCCTC ACGGTGACGA CTTTGAACCT
TTTGTTGAAA TGAACTGGAT CCATAACAGT AAAGATTTTG CTGTCTCAAT GAATGGTGTG
AAAGTCGAAC AAGACGGGGC GCGTAATCTG GGGGAAATTA AACTTGGCGT AAATGGCAAT
CTGAATCCGT CGGCCAGCGT CTGGGGTAAT GTGGGCGTGC AGCTGGGTGA TAATGGCTAC
AATGACACCG CAATGATGGT GGGCCTGAAA TATAAGTTCT GA
 
Protein sequence
MQGNNTIVTT GDYSIGLLSQ TNGNLNTDTI IRVNTDGSVN PHLSDEGNTF IVTAGNHAVG 
VLACASPGSA RACVSSLDEE STADTGSNEN NAKAKLDMAK GEITTHGTES YAAYANGTVV
KAGSKLDYTN ASVTLTDVDI TTHGDNAHAI AARQGTVSFN QGEIHTTGPD AAIAKIYNGG
KVTLKNTSAV AHQGAGIVLE SSINGQEATV DILSGSSLRS ANEILYNKNE TSNVTITDSE
VSSAADVFIN NIKGHLVIDA SNSKITGSAN LSTDDSTHTY LSLSDNSTWD IKTDSTVSKL
TVDNSTVYIS RADGKAFEPT RLTITENYVG NNGVLHLRTE LGDDNSATDK VVINGNTSGT
TRVKVTNAGG SGAYTLNGIE IISVEGESNG EFIKDSRIFA GAYEYSLTRG NTEATNKNWY
LTNFQATSGG ETNSGGSSAP TVAPTPVLRP EAGSYVANLA AANTLFVMRL NDRAGETRYI
DPVTEQERSS RLWLRQIGGH NAWRDSNGQL RTTSHRYVSQ LGADLLTGGF TDSDSWRLGV
MAGYARDYNS THSSVSDYRS KGSVRGYSAG LYATWFADDI SKKGAYIDAW AQYSWFKNSV
KGDELAYESY SAKGATVSLE AGYGFALNKS FGLEAAKYTW IFQPQAQAIW MGVDHNAHTE
ANGSRIENDA NNNFQTRLGF RTFIRTQEKN SGPHGDDFEP FVEMNWIHNS KDFAVSMNGV
KVEQDGARNL GEIKLGVNGN LNPSASVWGN VGVQLGDNGY NDTAMMVGLK YKF