Gene EcSMS35_2302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2302 
SymbolcirA 
ID6147114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2330491 
End bp2332416 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content53% 
IMG OID641617176 
Productcolicin I receptor 
Protein accessionYP_001744349 
Protein GI170682371 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.010442 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTAGCGG TCGATGATGA TGGCGAAACG ATGGTTGTCA CTGCATCTTC CGTTGAGCAA 
AACCTTAAAG ATGCACCCGC CAGTATCAGC GTCATTACCC AGGAAGACCT GCAGCGAAAA
CCGGTACAGA ATCTGAAGGA TGTCCTCAAA GAAGTGCCTG GCGTACAACT GACGAACGAA
GGGGATAACC GTAAGGGCGT AAGTATTCGT GGTCTGGACA GCAGCTACAC CCTGATTCTT
GTCGACGGTA AACGCGTTAA CTCCCGCAAT GCCGTCTTCC GCCACAATGA TTTCGATCTG
AACTGGATCC CGGTCGATTC CATCGAACGT ATTGAAGTGG TCCGTGGCCC GATGTCGTCG
CTGTACGGTT CCGATGCGCT CGGCGGTGTA GTGAATATCA TCACCAAAAA AATCGGTCAG
AAATGGTCGG GCACCGTTAC CGTCGATACC ACCGTTCAGG AACATCGCGA TCGCGGTGAT
ACCTATAACG GTCAGTTCTT TACCAGCGGA CCATTAATTG ACGGCGTGCT GGGAATGAAA
GCTTACGGCA GCCTGGCAAA ACGTGAAAAG GATGACCCGC AAAACTCAAC GACCACCGAT
ACCGGAGAAA CGCCGCGTAT TGAAGGATTC TCCAGCCGCG ACGGCAATGT CGAATTTGCC
TGGACACCGA ATCAAAATCA CGATTTTACT GCCGGATACG GTTTCGACCG TCAGGATCGT
GATTCCGACT CGCTGGACAA AAACCGCCTG GAACGCCAGA ACTACTCCGT CAGCCATAAT
GGGCGTTGGG ATTACGGCAC CAGCGAACTG AAATACTACG GTGAGAAAGT CGAGAACAAA
AACCCTGGCT ACAGCAGCCC GATAACTTCC GAAAGCAATA CGGTCGACGG CAAATACACG
TTGCCGCTGA CGGCGATTAA TCAGTTTCTC ACGGTTGGCG GTGAATGGCG TCACGACAAA
CTTAGCGATG CGGTGAACCT GACCGGGGGA ACCAGCTCCA AAACGTCTGC CAGCCAGTAC
GCGCTGTTTG TGGAAGATGA ATGGCGGATC TTCGAGCCGC TGGCGCTGAC GACCGGCGTA
CGTATGGACG ATCATGAAAC CTACGGTGAA CACTGGAGTC CGCGTGCCTA CCTGGTTTAT
AACGCCACCG ACACCGTAAC GGTGAAAGGG GGCTGGGCGA CGGCATTTAA AGCGCCTTCT
CTGTTGCAAC TTAGCCCTGA CTGGACGAGC AATTCCTGCC GTGGCGCATG TAAGATTGTG
GGTAGCCCGG ATCTGAAACC AGAAACCAGC GAAAGTTGGG AACTGGGGCT TTACTACATG
GGCGAAGAAG GCTGGCTGGA AGGGGTTGAA TCCAGCGTTA CCGTTTTCCG TAACGATGTG
AAAGATCGTA TCAGCATTAG CCGTACGTCT GACGTCAACG CTGCACCGGG CTACCAAAAC
TTTGTCGGTT TTGAGACGGG CGCTAACGGA CGGCGCATAC CGGTATTTAG CTACTACAAC
GTTAACAAAG CTCGTATTCA GGGCGTGGAA ACCGAACTGA AAATTCCGTT CAACGATGAA
TGGAAACTGT CGATCAACTA CACCTACAAC GATGGTCGTG ATGTCAGCAA CGGCGAAAAC
AAACCGCTAT CCGATCTGCC GTTCCATACT GCTAACGGTA CGCTGGACTG GAAACCGCTG
GCGCTGGAAG ACTGGTCATT CTATGTTTCT GGTCACTATA CCGGGCAGAA ACGCGCCGAC
AGCGCGACGG CTAAAACACC GGGCGGTTAT ACCATCTGGA ATACCGGTGC GGCCTGGCAG
GTGACTAAAG ACGTCAAACT GCGCGCAGGC GTGCTGAACC TTGGCGACAA GGATCTCAGT
CGTGACGACT ACAGCTATAA CGAAGACGGA CGTCGTTACT TTATGGCAGT GGATTATCGC
TTCTGA
 
Protein sequence
MLAVDDDGET MVVTASSVEQ NLKDAPASIS VITQEDLQRK PVQNLKDVLK EVPGVQLTNE 
GDNRKGVSIR GLDSSYTLIL VDGKRVNSRN AVFRHNDFDL NWIPVDSIER IEVVRGPMSS
LYGSDALGGV VNIITKKIGQ KWSGTVTVDT TVQEHRDRGD TYNGQFFTSG PLIDGVLGMK
AYGSLAKREK DDPQNSTTTD TGETPRIEGF SSRDGNVEFA WTPNQNHDFT AGYGFDRQDR
DSDSLDKNRL ERQNYSVSHN GRWDYGTSEL KYYGEKVENK NPGYSSPITS ESNTVDGKYT
LPLTAINQFL TVGGEWRHDK LSDAVNLTGG TSSKTSASQY ALFVEDEWRI FEPLALTTGV
RMDDHETYGE HWSPRAYLVY NATDTVTVKG GWATAFKAPS LLQLSPDWTS NSCRGACKIV
GSPDLKPETS ESWELGLYYM GEEGWLEGVE SSVTVFRNDV KDRISISRTS DVNAAPGYQN
FVGFETGANG RRIPVFSYYN VNKARIQGVE TELKIPFNDE WKLSINYTYN DGRDVSNGEN
KPLSDLPFHT ANGTLDWKPL ALEDWSFYVS GHYTGQKRAD SATAKTPGGY TIWNTGAAWQ
VTKDVKLRAG VLNLGDKDLS RDDYSYNEDG RRYFMAVDYR F