Gene EcolC_2838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2838 
Symbol 
ID6065134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3104869 
End bp3107151 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content54% 
IMG OID641602244 
Productcatecholate siderophore receptor Fiu 
Protein accessionYP_001725793 
Protein GI170020839 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4774] Outer membrane receptor for monomeric catechols 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.167931 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAACA ATCGCAATTT CCCTGCCAGA CAATTTCATT CGCTCACGTT CTTTGCCGGT 
CTTTGTATTG GCATCACGCC TGTGGCTCAG GCACTCGCCG CCGAAGGGCA AACTAACGCG
GATGACACGC TGGTTGTCGA AGCATCAACG CCTTCGCTTT ATGCGCCACA ACAATCTGCC
GATCCGAAAT TCTCGCGTCC GGTAGCGGAT ACTACCCGCA CGATGACGGT AATTTCTGAA
CAAGTGATTA AAGATCAGGG CGCAACCAAC CTTACCGACG CGCTCAAAAA CGTCCCCGGC
GTGGGTGCGT TTTTTGCGGG TGAGAACGGT AACTCCACCA CTGGCGACGC CATTTATATG
CGTGGTGCCG ATACCTCTAA CAGTATTTAT ATTGATGGCA TTCGCGATAT CGGCAGCGTC
TCGCGCGACA CCTTCAATAC CGAGCAGGTC GAAGTGATTA AAGGGCCGTC CGGCACCGAC
TACGGGCGCA GCGCACCGAC AGGCTCGATC AATATGATCA GCAAGCAGCC GCGCAATGAT
TCCGGCATTG ACGCCTCCGC CAGTATTGGC AGCGCCTGGT TCCGCCGCGG CACGCTGGAC
GTCAATCAGG TCATTGGTGA TACCACCGCG GTGCGCCTGA ATGTAATGGG CGAAAAAACG
CACGATGCCG GACGCGACAA AGTCAAAAAT GAGCGTTACG GCGTCGCCCC TTCTGTCGCT
TTTGGCCTTG GTACAGCGAA TCGTTTGTAT CTTAATTATC TGCATGTCAC CCAGCACAAC
ACGCCAGACG GCGGCATTCC GACCATCGGT TTGCCGGGCT ATTCTGCCCC ATCTGCGGGA
ACGGCGGCCC TGAATCATTC CGGAAAAGTT GATACTCATA ACTTTTACGG CACGGATTCC
GATTACGACG ATTCGACCAC CGACACCGCC ACCATGCGTT TTGAGCACGA CATCAACGAT
AACACCACCA TTCGCAATAC TACCCGTTGG TCGCGCGTAA AGCAGGATTA CCTGATGACG
GCGATTATGG GCGGGGCGTC GAATATTACT CAGCCCACCA GCGATGTGAA TAGCTGGACC
TGGTCACGCA CGGCGAATAC CAAAGATGTG AGTAATAAAA TTCTCACCAA CCAGACCAAC
CTGACCTCGA CGTTCTATAC CGGTTCTATC GGTCATGATG TCAGTACCGG CGTGGAATTT
ACCCGTGAAA CGCAGACTAA CTACGGCGTT AATCCGGTGA CGTTACCCGC GGTAAATATT
TATCATCCTG ACAGCAGCAT TCATCCCGGC GGCCTGACGC GCAACGGCGC AAACGCCAAT
GGTCAGACGG ATACCTTCGC AATTTACGCC TTTGATACGC TGCAAATCAC CCGTGATTTT
GAGCTGAACG GCGGGATCCG TCTGGATAAT TATCATACTG AATATGACAG TGCCACCGCT
TGCGGCGGCA GCGGACGCGG TGCCATCACC TGCCCAACTG GTGTGGCAAA AGGTTCTCCG
GTCACCACCG TCGACACCGC CAAGTCGGGC AATCTGATGA ACTGGAAAGC CGGGGCGCTG
TATCACCTGA CGGAAAACGG CAATGTCTAT ATTAACTATG CCGTTTCCCA GCAGCCTCCG
GGCGGCAACA ACTTCGCCCT TGCGCAGTCT GGCAGCGGTA ACAGTGCCAA CCGCACCGAT
TTTAAACCGC AAAAAGCCAA CACCAGCGAG ATTGGCACCA AATGGCAGGT TCTGGATAAA
CGTCTGTTGC TCACCGCCGC GCTGTTCCGC ACTGATATCG AAAATGAAGT TGAGCAAAAT
GATGACGGAA CTTACTCGCA ATACGGTAAG AAACGCGTCG AAGGCTATGA GATATCCGTG
GCCGGGAATA TCACTCCCGC GTGGCAGGTG ATTGGCGGCT ATACCCAGCA AAAAGCAACC
ATCAAAAACG GCAAAGATGT TGCCCAGGAT GGTTCCTCAT CGCTGCCGTA TACCCCGGAG
CACGCCTTCA CCTTATGGAG CCAATATCAG GCAACCGACG ATATCTCTGT TGGCGCGGGC
GCACGCTATA TCGGCAGTAT GCATAAAGGT TCAGACGGCG CGGTGGGAAC GCCAGCGTTT
ACCGAAGGTT ACTGGGTCGC CGATGCCAAA CTGGGGTATC GAGTTAATCG CAATCTCGAC
TTCCAGCTAA ACGTTTACAA CCTGTTTGAT ACCGATTACG TCGCCTCAAT CAACAAGAGC
GGCTACCGTT ATCACCCGGG CGAGCCAAGA ACCTTCTTGC TCACAGCCAA TATGCATTTC
TGA
 
Protein sequence
MENNRNFPAR QFHSLTFFAG LCIGITPVAQ ALAAEGQTNA DDTLVVEAST PSLYAPQQSA 
DPKFSRPVAD TTRTMTVISE QVIKDQGATN LTDALKNVPG VGAFFAGENG NSTTGDAIYM
RGADTSNSIY IDGIRDIGSV SRDTFNTEQV EVIKGPSGTD YGRSAPTGSI NMISKQPRND
SGIDASASIG SAWFRRGTLD VNQVIGDTTA VRLNVMGEKT HDAGRDKVKN ERYGVAPSVA
FGLGTANRLY LNYLHVTQHN TPDGGIPTIG LPGYSAPSAG TAALNHSGKV DTHNFYGTDS
DYDDSTTDTA TMRFEHDIND NTTIRNTTRW SRVKQDYLMT AIMGGASNIT QPTSDVNSWT
WSRTANTKDV SNKILTNQTN LTSTFYTGSI GHDVSTGVEF TRETQTNYGV NPVTLPAVNI
YHPDSSIHPG GLTRNGANAN GQTDTFAIYA FDTLQITRDF ELNGGIRLDN YHTEYDSATA
CGGSGRGAIT CPTGVAKGSP VTTVDTAKSG NLMNWKAGAL YHLTENGNVY INYAVSQQPP
GGNNFALAQS GSGNSANRTD FKPQKANTSE IGTKWQVLDK RLLLTAALFR TDIENEVEQN
DDGTYSQYGK KRVEGYEISV AGNITPAWQV IGGYTQQKAT IKNGKDVAQD GSSSLPYTPE
HAFTLWSQYQ ATDDISVGAG ARYIGSMHKG SDGAVGTPAF TEGYWVADAK LGYRVNRNLD
FQLNVYNLFD TDYVASINKS GYRYHPGEPR TFLLTANMHF