Gene EcSMS35_3324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3324 
SymbolfitA 
ID6144067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3399821 
End bp3401962 
Gene Length2142 bp 
Protein Length713 aa 
Translation table11 
GC content48% 
IMG OID641618153 
ProductTonB-dependent outer membrane ferric coprogen receptor FitA 
Protein accessionYP_001745303 
Protein GI170681849 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4773] Outer membrane receptor for ferric coprogen and ferric-rhodotorulic acid 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.992566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.497085 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATGT TCACACCTTC ATTCTCAGGA CTCAAAGGTC GGGCGCTCTT TTCACTGCTT 
TTTGCGGCAC CGATGATTCA TGCAACAGAC TCTGTAACGA CCAAAGATGG CGAAACAATC
ACTGTTACAG CAGATGCAAA TACCGCAACT GAGGCAACCG ATGGTTATCA ACCTCTGAGC
ACTTCCACGG CGACATTAAC CGATATGCCG ATGCTGGATA TCCCGCAGGT GGTCAATACG
GTTAGCGATC AGGTTCTGGA AAATCAGAAT GCGACGACGC TGGATGAAGC GCTTTATAAC
GTCAGTAACG TGGTACAGAC CAATACATTA GGCGGAACTC AGGACGCCTT TGTACGTCGT
GGGTTTGGTG CTAACCGGGA TGGCTCCATC ATGACCAACG GCCTGCGAAC TGTACTTCCT
CGCAGTTTCA ACGCCGCCAC AGAACGTGTG GAAGTGCTAA AAGGTCCGGC CTCCACGCTG
TATGGCATTC TCGATCCTGG TGGACTGATT AACGTCGTGA CCAAGCGCCC GGAAAAAACA
TTCCATGGTT CGGTTTCAGC CACCTCCTCC AGTTTTGGTG GCGGCACTGG GCAACTTGAT
ATCACAGGTC CCATTGAAGG CACTCAGCTG GCGTATCGCC TTACCGGGGA AGTGCAGGAT
GAAGATTACT GGCGAAACTT CGGTAAAGAG CGCAGTACAT TTATTGCCCC GTCACTCACC
TGGTTTGGTG ATAATGCAAC AGTAACTATG CTCTATTCCC ATCGGGACTA TAAAACTCCG
TTCGATCGTG GAACGATTTT CGACCTTACG ACGAAACAGC CCGTAAACGT TGATCGAAAA
ATACGTTTTG ACGAACCGTT TAATATTACA GATGGAAAGT CCGATCTGGC GCAACTCAAC
GCAGAATATC ATCTCAATAG CCAGTGGACA GCGCGCTTTG ATTACAGCTA CAGCCAGGAT
AAATACAGCG ATAATCAGGC GCGTGTTACC GCGTATGATG CAACGACAGG AACACTGACA
CGGCGTGTTG ATGCAACTCA GGGATCTACC CAGCGTATGC ATGCTACTCG TGCGGATCTG
CAAGGGAATG TTGATATTGC CGGATTCTAT AATGAGATTC TGGGTGGGGT GTCATATGAA
TATTATGATC TTCTGCGTAC AGATATGATT CGCTGTAAAA AAGCTAAAGA TTTCAATATC
TACAACCCTG TTTATGGTAA TACCAGCAAA TGTACAACGG TTTCGGCGTC GGACAGTGAT
CAGACGATCA AACAGGAGAG CTACTCAGCG TATGCACAGG ACGCACTCTA TCTGACCGAT
AACTGGATTG CCGTCGCCGG GATCCGCTAT CAGTATTACA CGCAGTATGC GGGTAAAGGC
CGTCCTTTTA ATGTCAATAC TGACAGCCGC GATGAACAAT GGACTCCCAA ACTGGGGTTA
GTCTACAAAC TGACGCCATC GGTATCCTTA TTTGCCAATT ATTCGCAAAC ATTTATGCCG
CAGTCGTCAA TTGCCAGCTA CATTGGCGAT CTTCCACCGG AATCATCTAA TGCTTACGAA
GTCGGGGCAA AATTCGAGCT ATTCGATGGT ATCACCGCAG ATATTGCGCT GTTTGATATC
CATAAACGTA ATGTGTTGTA TACCGAAAGT GTTGGTGATG AAACCATTGC CAAAACGGCA
GGCCGTGTTC GTTCAAGAGG GGTAGAAGTC GACCTTGCGG GAGCATTAAC TGAAAACATT
AATATCATTG CCAGCTACGG CTATACCGAT GCTAAGGTTC TGGAGGATCC TGATTATGCA
GGGAAACCAT TGCCGAATGT TCCTCGTCAT ACCGGTTCGC TATTCCTGAC TTATGACATT
CATAACATGC CTGGCAATAA CACACTGACG TTTGGCGGTG GTGGACATGG CGTAAGCCGT
CGTTCGGCAA CCAATGGGGC TGACTATTAT CTGCCAGGCT ATTTCGTTGC CGATGCCTTC
GCCGCATACA AAATGAAATT GCAGTATCCG GTCACACTGC AATTAAACGT CAAAAACCTG
TTTGATAAAA CGTATTACAC CTCTTCCATC GCCACAAATA ATCTGGGCAA CCAGATTGGC
GATCCTCGTG AAGTGCAATT CACGGTGAAA ATGGAATTTT GA
 
Protein sequence
MAMFTPSFSG LKGRALFSLL FAAPMIHATD SVTTKDGETI TVTADANTAT EATDGYQPLS 
TSTATLTDMP MLDIPQVVNT VSDQVLENQN ATTLDEALYN VSNVVQTNTL GGTQDAFVRR
GFGANRDGSI MTNGLRTVLP RSFNAATERV EVLKGPASTL YGILDPGGLI NVVTKRPEKT
FHGSVSATSS SFGGGTGQLD ITGPIEGTQL AYRLTGEVQD EDYWRNFGKE RSTFIAPSLT
WFGDNATVTM LYSHRDYKTP FDRGTIFDLT TKQPVNVDRK IRFDEPFNIT DGKSDLAQLN
AEYHLNSQWT ARFDYSYSQD KYSDNQARVT AYDATTGTLT RRVDATQGST QRMHATRADL
QGNVDIAGFY NEILGGVSYE YYDLLRTDMI RCKKAKDFNI YNPVYGNTSK CTTVSASDSD
QTIKQESYSA YAQDALYLTD NWIAVAGIRY QYYTQYAGKG RPFNVNTDSR DEQWTPKLGL
VYKLTPSVSL FANYSQTFMP QSSIASYIGD LPPESSNAYE VGAKFELFDG ITADIALFDI
HKRNVLYTES VGDETIAKTA GRVRSRGVEV DLAGALTENI NIIASYGYTD AKVLEDPDYA
GKPLPNVPRH TGSLFLTYDI HNMPGNNTLT FGGGGHGVSR RSATNGADYY LPGYFVADAF
AAYKMKLQYP VTLQLNVKNL FDKTYYTSSI ATNNLGNQIG DPREVQFTVK MEF