Gene EcolC_3509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3509 
Symbol 
ID6068606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3830066 
End bp3832309 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content51% 
IMG OID641602926 
Productferrichrome outer membrane transporter 
Protein accessionYP_001726450 
Protein GI170021496 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0167136 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTT CCAAAACTGC TCAGCCAAAA CACTCACTGC GTAAAATCGC AGTTGTAGTA 
GCCACAGCGG TTAGCGGCAT GTCTGTTTAT GCACAGGCAG CGGTTGAACC GAAAGAAGAC
ACTATCACCG TTACCGCTGC ACCTGCGCCG CAAGAAAGCG CATGGGGGCC TGCTGCAACT
ATTGCGGCGC GACAGTCAGC TACCGGCACT AAAACCGATA CGCCGATTCA AAAAGTGCCA
CAGTCTATTT CTGTTGTGAC CGCCGAAGAG ATGGCGCTGC ATCAGCCGAA GTCGGTAAAA
GAAGCGCTTA GCTACACGCC GGGTGTCTCT GTTGGTACGC GTGGCGCATC CAACACCTAT
GACCACCTGA TCATTCGCGG TTTTGCGGCA GAAGGCCAAA GCCAGAATAA CTATCTGAAT
GGCCTGAAGT TGCAGGGCAA CTTCTATAAC GATGCGGTCA TTGATCCGTA TATGCTGGAA
CGCGCTGAAA TTATGCGTGG CCCGGTTTCC GTGCTTTACG GTAAAAGCAG TCCTGGCGGT
CTGTTGAATA TGGTCAGCAA GCGTCCGACC ACCGAACCGC TCAAAGAAGT TCAGTTTAAA
GCCGGTACTG ACAGCCTGTT CCAGACTGGT TTTGACTTTA GCGATGCGCT GGATGATGAC
GGCGTTTACT CTTATCGCCT GACCGGTCTT GCGCGTTCTG CCAATGCCCA GCAGAAAGGG
TCAGAAGAGC AGCGTTATGC TATTGCACCG GCGTTCACCT GGCGTCCGGA TGATAAAACC
AATTTCACCT TCCTTTCTTA CTTCCAGAAC GAGCCGGAAA CCGGTTATTA CGGCTGGTTG
CCGAAAGAGG GAACCGTTGA GCCGCTGCCG AACGGTAAGC GTCTGCCGAC AGACTTTAAT
GAAGGGGCGA AGAACAACAC CTATTCTCGT AATGAGAAGA TGGTCGGCTA CAGCTTCGAT
CACGAATTTA ACGACACCTT TACTGTGCGT CAGAACCTGC GCTTTGCTGA AAACAAAACC
TCGCAAAACA GCGTTTATGG TTACGGCGTC TGCTCCGATC CGGCGAATGC TTACAGCAAA
CAGTGTGCGG CATTAGCGCC AGCGGATAAA GGCCATTATC TGGCACGTAA ATACGTCGTT
GATGATGAGA AGCTGCAAAA CTTCTCCGTT GATACCCAGT TGCAGAGCAA GTTTGCCACT
GGCGATATCG ACCACACCCT GCTGACCGGT GTCGACTTTA TGCGTATGCG TAATGACATC
AACGCCTGGT TTGGTTACGA CGACTCTGTG CCACTGCTCA ATCTGTACAA TCCGGTGAAT
ACCGATTTCG ACTTCAATGC CAAAGATCCG GCAAACTCCG GCCCTTACCG CATTCTGAAT
AAGCAGAAAC AAACGGGCGT TTATGTTCAG GATCAGGCGC AGTGGGATAA AGTGCTGGTC
ACCCTGGGCG GTCGTTATGA CTGGGCAGAT CAAGAATCTC TTAACCGCGT TGCCGGGACG
ACCGATAAAC GTGATGACAA ACAGTTTACC TGGCGTGGTG GTGTTAACTA CCTGTTTGAT
AATGGTGTAA CACCTTACTT CAGCTATAGC GAATCGTTTG AACCTTCTTC GCAAGTTGGG
AAGGATGGTA ATATTTTCGC ACCGTCTAAA GGTAAGCAGT ATGAAGTCGG CGTGAAATAT
GTACCGGAAG ATCGTCCGAT TGTAGTTACT GGTGCCGTGT ATAATCTCAC TAAAACAAAC
AACCTGATGG CGGACCCTGA GGGTTCCTTC TTCTCGGTTG AAGGTGGCGA GATCCGCGCT
CGTGGCGTAG AAATCGAAGC GAAAGCGGCG CTGTCGGCGA GTGTTAACGT AGTCGGTTCT
TATACTTACA CCGATGCGGA ATACACCACC GATACTACCT ATAAAGGCAA TACGCCTGCA
CAGGTGCCAA AACACATGGC TTCGTTGTGG GCTGACTACA CCTTCTTTGA CGGTCCGCTT
TCAGGTCTGA CGCTGGGCAC CGGTGGTCGT TATACTGGCT CCAGTTATGG TGATCCGGCT
AACTCCTTTA AAGTGGGAAG TTATACGGTC GTGGATGCGT TAGTACGTTA TGATTTGGCG
CGAGTCGGCA TGGCTGGCTC CAACGTGGCG CTGCATGTTA ACAACCTGTT CGATCGTGAA
TACGTCGCCA GCTGCTTTAA CACTTATGGC TGCTTCTGGG GCGCAGAACG TCAGGTCGTT
GCAACCGCAA CCTTCCGTTT CTAA
 
Protein sequence
MARSKTAQPK HSLRKIAVVV ATAVSGMSVY AQAAVEPKED TITVTAAPAP QESAWGPAAT 
IAARQSATGT KTDTPIQKVP QSISVVTAEE MALHQPKSVK EALSYTPGVS VGTRGASNTY
DHLIIRGFAA EGQSQNNYLN GLKLQGNFYN DAVIDPYMLE RAEIMRGPVS VLYGKSSPGG
LLNMVSKRPT TEPLKEVQFK AGTDSLFQTG FDFSDALDDD GVYSYRLTGL ARSANAQQKG
SEEQRYAIAP AFTWRPDDKT NFTFLSYFQN EPETGYYGWL PKEGTVEPLP NGKRLPTDFN
EGAKNNTYSR NEKMVGYSFD HEFNDTFTVR QNLRFAENKT SQNSVYGYGV CSDPANAYSK
QCAALAPADK GHYLARKYVV DDEKLQNFSV DTQLQSKFAT GDIDHTLLTG VDFMRMRNDI
NAWFGYDDSV PLLNLYNPVN TDFDFNAKDP ANSGPYRILN KQKQTGVYVQ DQAQWDKVLV
TLGGRYDWAD QESLNRVAGT TDKRDDKQFT WRGGVNYLFD NGVTPYFSYS ESFEPSSQVG
KDGNIFAPSK GKQYEVGVKY VPEDRPIVVT GAVYNLTKTN NLMADPEGSF FSVEGGEIRA
RGVEIEAKAA LSASVNVVGS YTYTDAEYTT DTTYKGNTPA QVPKHMASLW ADYTFFDGPL
SGLTLGTGGR YTGSSYGDPA NSFKVGSYTV VDALVRYDLA RVGMAGSNVA LHVNNLFDRE
YVASCFNTYG CFWGAERQVV ATATFRF