Gene EcSMS35_1678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1678 
Symbol 
ID6146423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1674518 
End bp1676890 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content46% 
IMG OID641616554 
ProductTonB-dependent receptor 
Protein accessionYP_001743732 
Protein GI170681386 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAG TTCTTATTCC TGGCGTCATT TTATGTGGCG CTGATGTAGC GCAGGCCGTC 
GATGACAAAA ATATGTACAT GCATGTTTTT GAAGAGATGA CGGTCTATGC TCCTGTCCCT
GTACCCGTAA ACGGCAACAC GCATTACACC AGTGAAAGCA TCGAGCGTTT ACCGACCGGG
AATGGCAATA TCAGCGATCT GCTGAGAACC AACCCTGCGG TACGCATGGA TTCAACGCAA
AGTACCTCAT TGAACCAGGG AGATATTCGC CCGGAGAAAA TCTCTATTCA CGGTGCGTCG
CCCTACCAGA ATGCCTATTT GATTGACGGT ATTAGTGCAA CGAATAACCT GAACCCAGCG
AATGAGTCCG ATGCCAGTAG TGCAACCAAT ATTAGCGGGA TGTCACAGGG GTATTATCTT
GATGTCAGCT TACTGGACAA TGTGACGCTT TATGACAGTT TTGTGCCGGT TGAATTCGGT
CGCTTCAATG GCGGGGTAAT TGATGCAAAG ATCAAACGCT TCAACGCTGA TGATAGCAGC
GTGAAACTGG GTTATCGCAC TACGCGTTCG GACTGGTTAA CATCGCATAT CGATGAGAAT
AACAAGAGCG CATTTAATCA AGGTTCTTCA GGAAGTACTT ATTACTCTCC AGATTTTAAA
AAGAACTTTT ATACCTTGTC GTTTAATCAG GAACTCGCTG ATAACTTTGG CGTTACCGCC
GGTTTATCGC GCCGCCAGTC TGATATCACC CGCGCGGATT ATGTTTCGAA TGACGGCATT
GTCGCCGGCC GGGCACAGTA TAAAAACGTT ATCGATACTG CATTGAGCAA ATTTACCTGG
TTTGCCAGCG ACCGCTTTAC CCACGATTTA ACCTTAAAAT ATACCGGCTC CAGCCGTGAT
TATAATACCA GCACCTTCCC GGAGTCTGAT CGCGAAATGG GTAATAAATC CTATGGTCTG
GCATGGGATA TGGATACCCA GCTCGCATGG GCCAAACTGC GTACCACCGT TGGTTGGGAT
CATATTAGTG ATTATACCCG TCACGATCAT GACATCTGGT ACACCGAACT TTCATGTACA
TATGGTGATA TTACAGGGCG TTGCACCCGT GGCGGATTAG GACACATTTC CCAGGCTGTA
GATAATTACA CCTTCAAAAC ACGCCTGGAC TGGCAAAAAT TCACCGTGGG TAATGTTTCG
CATCAACCCT ACTTCGGCGC GGAATACATC TATTCCGATG CATGGACTGA ACGCCATAAC
CAGTCTGAAT CCTATGTGAT TAATGCTGCC GGAAAGAAAA CTAACCATAC CATTTACCAT
AAAGGTAAAG GCAGCCTGGG AATTGACAAC TACACGCTGT ATATGGCGGA TCGCATTAGC
TGGCGTAATG TGTCATTAAT GCCCGGCGTG CGGTATGACT ATGACAACTA TCTGTCAAAC
CACAATATCT CCCCGCGCTT TATGACGGAA TGGGATATTT TTGCTGATCA AACCTCAATG
ATTACCGCAG GTTATAACCG TTACTATGGC GGGAATATTC TTGATATGGG ATTACGTGAT
ATCCGCAATA GCTGGACGGA ATCGGTATCA GGCAATAAAA CCCTGACGCG TTATCAGGAT
TTGAAAACGC CTTATAACGA TGAACTGGCA ATGGGGTTGC AGCAGAAAAT CGGTAAGAAC
GTTATTGCGC GTGCAAACTA TGTTTACCGT GAAGCGCATG ATCAAATCAG TAAAAGCAGT
CGTACCGACA GCGCGACTAA AACCACCATT ACTGAATATA ACAACGACGG TAAAACCAAA
ACGCATTCGT TCAACCTCAG TTTTGAACTG GCCGAACCCC TGCATATCAG CCAGGTAGAT
ATTAACCCGC AAATTGTCTT TAGCTATATC AAGAGCAAGG GCAACTTGTC GTTAAACAAT
GGTTATGAGG AGAGCAATAC CGGTGATAAC CAGGTGGTTT ATAACGGTAA TCTGGTCTCT
TACGATAGCG TTCCAGTGGC GGATTTTAAT AATCCATTAA AGATCTCCTT AAACATGGAT
TTCACGCATC AACCGAGCGG GTTGGTGTGG GCGAATACGC TGACCTGGCA AGAAGCGCGT
AAAGCTCGCA TTATCCTGGG TAAGACGAAT GCGCAATACA TCAGCGAATA TTCAGATTAC
AAGCAGTATG TTGACGAAAA ACTGGATAGC AGCCTGACCT GGGACACCCG CTTGTCCTGG
ACGCCACAAT TTCTGAAACA ACAAAACCTG ACGTTCAGTG CCGATATTCT CAATGTACTG
GATAGCAAAA CCGCTGTTGA TACAACGAAT ACCGGTGTGG CGACCTACGC CAGTGGCCGT
ACTTTCTGGC TTGATGTCAG CATGAAATTT TAA
 
Protein sequence
MKRVLIPGVI LCGADVAQAV DDKNMYMHVF EEMTVYAPVP VPVNGNTHYT SESIERLPTG 
NGNISDLLRT NPAVRMDSTQ STSLNQGDIR PEKISIHGAS PYQNAYLIDG ISATNNLNPA
NESDASSATN ISGMSQGYYL DVSLLDNVTL YDSFVPVEFG RFNGGVIDAK IKRFNADDSS
VKLGYRTTRS DWLTSHIDEN NKSAFNQGSS GSTYYSPDFK KNFYTLSFNQ ELADNFGVTA
GLSRRQSDIT RADYVSNDGI VAGRAQYKNV IDTALSKFTW FASDRFTHDL TLKYTGSSRD
YNTSTFPESD REMGNKSYGL AWDMDTQLAW AKLRTTVGWD HISDYTRHDH DIWYTELSCT
YGDITGRCTR GGLGHISQAV DNYTFKTRLD WQKFTVGNVS HQPYFGAEYI YSDAWTERHN
QSESYVINAA GKKTNHTIYH KGKGSLGIDN YTLYMADRIS WRNVSLMPGV RYDYDNYLSN
HNISPRFMTE WDIFADQTSM ITAGYNRYYG GNILDMGLRD IRNSWTESVS GNKTLTRYQD
LKTPYNDELA MGLQQKIGKN VIARANYVYR EAHDQISKSS RTDSATKTTI TEYNNDGKTK
THSFNLSFEL AEPLHISQVD INPQIVFSYI KSKGNLSLNN GYEESNTGDN QVVYNGNLVS
YDSVPVADFN NPLKISLNMD FTHQPSGLVW ANTLTWQEAR KARIILGKTN AQYISEYSDY
KQYVDEKLDS SLTWDTRLSW TPQFLKQQNL TFSADILNVL DSKTAVDTTN TGVATYASGR
TFWLDVSMKF