Gene EcSMS35_3180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3180 
SymboliucC 
ID6147083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3260790 
End bp3262532 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content54% 
IMG OID641618020 
Productaerobactin siderophore biosynthesis protein IucC 
Protein accessionYP_001745170 
Protein GI170679922 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4264] Siderophore synthetase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCACA AAGACTGGGA TTTGGTCAAC CGCCGCCTGG TGGCAAAAAT GTTGTCTGAG 
CTGGAGTATG AGCAGGTTTT CCACGCGGAA TCTCAGGGCG ATGACCGCTA CTGCATTAAC
CTGCCGGGAG CACAATGGCG CTTCATCGCT GAACGTGGTA TCTGGGGCTG GCTCTGGATT
GATGCTCAAA CTCTGCGCTG CGCGGACGAG CCAGTACTGG CTCAGACGCT GCTGATGCAG
CTAAAGCAGG TACTGTCAAT GAGCGATGCA ACCGTTGCTG AGCATATGCA GGATTTGTAT
TCCACGCTGC TGGGCGACCT GCAACTGCTG AAAGCCCGTC GCGGGCTGAG CGCCAGTGAC
CTGATTAATC TTAGTGCCGA CCGCCTGCAA TGCCTGCTGA GCGGTCATCC TAAATTCGTT
TTTAATAAAG GTCGCCGTGG CTGGGGTAAA GAGGCGCTGG AACGATATGC GCCAGAGTAT
GCCAACACCT TTAGACTGCA CTGGCTGGCG GTAAAACGTG AACATATGAT CTGGCGCTGT
GATAACGAGA TGGATATTCA TCAGTTGTTG ACGGCCGCAA TGGATCCGCA GGAGTTTGCC
CGCTTCAGTC AGGTCTGGCA GGAAAACGGA CTGGATCATA ACTGGCTGCC GCTGCCTGTA
CATCCGTGGC AGTGGCAGCA AAAAATCGCC ACCGACTTCA TCGCTGATTT TGCCGAAGGC
AGGATGGTGT CTCTCGGCGA GTTTGGCGAC CAGTGGCTCG CCCAGCAGTC GCTGCGTACC
CTGACCAACG CCAGTCGCCG GGGTGGGCTG GATATCAAGC TGCCTCTGAC CATCTACAAC
ACCTCATGCT ACCGGGGGAT TCCTGGCAGA TACATCGCTG CCGGACCACT GGCTTCACGC
TGGCTACAAC AGGTTTTTGC GACCGACGCT ACCCTGGTGC AAAGCGGTGC AGTGATCCTT
GGTGAACCGG CTGCAGGCTA TGTCTCCCAT GAAGGCTATG CCGCGCTTGC CCGGGCTCCC
TATCGCTACC AGGAAATGCT TGGTGTTATC TGGCGGGAGA ATCCGTGCCG CTGGCTGAAA
CCGGATGAAA GTCCGGTTCT GATGGCAACA CTGATGGAGT GCGACGAAAA CGATCAGCCG
CTGGCAGGCG CATATATAGA CCGCTCCGGG CTGGACGCTG AAACCTGGCT TACGCAACTG
TTCCGGGTGG TCGTGGTTCC TCTGTATCAC CTGCTTTGCC GCTACGGTGT CGCGCTTATT
GCACATGGAC AAAATATAAC GCTCGCCATG AAAGAGGGGG TTCCACAGCG TGTTCTGCTG
AAAGACTTCC AGGGCGATAT GCGGCTGGTG AAAGAAGAGT TCCCCGAAAT GGACTCTTTG
CCTCAGGAGG TTCGTGATGT TACATCCCGC CTGAGTGCGG ACTACTTAAT CCATGATTTG
CAGACGGGTC ACTTCGTGAC AGTACTGCGT TTTATTTCGC CACTGATGGT TCGTCTTGGC
GTACCTGAAA GGCGATTTTA TCAACTGCTG GCAGCAGTGT TGAGTGATTA CATGAAAAAA
CATCCACAAA TGTCAGAGCG TTTTGCGCTT TTCTCACTTT TCAGGCCACA AATCATTCGC
GTGGTGCTGA ACCCGGTAAA ACTGACCTGG CCTGATCTGG ATGGCGGCAG CCGCATGCTG
CCGAATTACC TTGAGGATCT GCAAAATCCG CTGTGGCTGG TAACTCAGGA ATATGAATCA
TGA
 
Protein sequence
MNHKDWDLVN RRLVAKMLSE LEYEQVFHAE SQGDDRYCIN LPGAQWRFIA ERGIWGWLWI 
DAQTLRCADE PVLAQTLLMQ LKQVLSMSDA TVAEHMQDLY STLLGDLQLL KARRGLSASD
LINLSADRLQ CLLSGHPKFV FNKGRRGWGK EALERYAPEY ANTFRLHWLA VKREHMIWRC
DNEMDIHQLL TAAMDPQEFA RFSQVWQENG LDHNWLPLPV HPWQWQQKIA TDFIADFAEG
RMVSLGEFGD QWLAQQSLRT LTNASRRGGL DIKLPLTIYN TSCYRGIPGR YIAAGPLASR
WLQQVFATDA TLVQSGAVIL GEPAAGYVSH EGYAALARAP YRYQEMLGVI WRENPCRWLK
PDESPVLMAT LMECDENDQP LAGAYIDRSG LDAETWLTQL FRVVVVPLYH LLCRYGVALI
AHGQNITLAM KEGVPQRVLL KDFQGDMRLV KEEFPEMDSL PQEVRDVTSR LSADYLIHDL
QTGHFVTVLR FISPLMVRLG VPERRFYQLL AAVLSDYMKK HPQMSERFAL FSLFRPQIIR
VVLNPVKLTW PDLDGGSRML PNYLEDLQNP LWLVTQEYES