Gene SbBS512_E4630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4630 
SymboliutA 
ID6268448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4324100 
End bp4326298 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content53% 
IMG OID641728399 
Productferric aerobactin receptor IutA 
Protein accessionYP_001882797 
Protein GI187732552 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCGCA AAAAGTATAT GCCCCGGGCT CTTGGTCCGC TGCTTCTTGT CGTGCTGTCA 
CCAGCTGTCG CCCAGCAAAA CGATGATAAT GAGATCATAG TGTCTGCCAG CCGCAGCAAT
CGAACTGTAG CGGAGATGGC GCAAACCACC TGGGTTATCG AAAATGCCGA ACTGGAGCAG
CAGATTCAGG GCGGTAAAGA GCTGAAAGAC GCACTGGCTC AGTTAATCCC CGGCCTTGAT
GTCAGCAGCC AGAGTCGAAC CAACTACGGT ATGAACATGC GTGGCCGCCC GCTGGTTGTC
CTGATTGACG GTGTGCGCCT CAACTCTTCA CGTTCCGACA GCCGACAACT GGACTCTGTC
GATCCTTTTA ATATCGACCA TATTGAAGTG ATCTCCGGCG CGACGGCCCT GTACGGTGGC
GGGAGTACCG GAGGGTTGAT CAACATCGTG ACCAAAAAAG GTCAGCCGGA AACCATGATG
GAGTTTGAGG CTGGCACAAA AAGTGGCTTT AACAGCAGTA AAGATCACGA TGAGCGCATT
GCCGGTGCTG TCTCCGGCGG AAATGACCAT ATCTCCGGAC GTCTTTCCGT GGCATATCAG
AAATTTGGCG GCTGGTTTGA CGGTAACGGC GATGCCACCC TGCTTGATAA CACCCAGACC
GGCCTGCAGC ACTCCAATCG GCTGGACATC ATGGGAACCG GTACGCTGAA CATCGATGAA
TCCCGGCAGC TTCAACTGAT AACGCAGTAC TATAAAAGTC AGGGAGACGA CAATTACGGG
CTTAATCTCG GGAAAGGCTT TTCCGCCATC AGCGGGAGCA GCACACCATA CGTCAGTAAG
GGGCTGAATT CTGACCGCAT TCCCGGCACT GAGCGGCATT TGATCAGCCT GCAGTACTCT
GACAGTGATT TCCTGGGACA GGAACTGGTC GGTCAGGTTT ACTACCGCGA TGAGTCGTTG
CGGTACTACC CGTTCCCGAC GGTAAATGCG AATAAACAGG CGACGGCTTT CTCCTCGTCA
CAGCAGGATA CCGACCAGTA CGGCATGAAA CTGACTCTGA ACAGCCAACT GATGGACGGC
TGGCAAATCA CCTGGGGGCT GGATGCTGAG CATGAGCGCT TTACCTCCAA CCAGATGTTC
TTCGATCTGG CTCAGGCAAG TGCTTCCGGA GGGCTGAACA ACCATAAGAT TTACACCACC
GGGCGCTATC CGTCATATGA CATCACCAAT CTGGCGGCCT TCCTGCAATC CAGCTATGAC
ATTAATGATA TTTTTACCGT TAGCGGTGGC GTACGCTATC AGTATACTGA GAACAGGGTA
GATGATTTCA TCGACTACAC GCAGCAACAG AAGATTGCTG CCGGGAAGGC GATATCTGCC
GACGCCATTC CTGGTGGTTC GGTAGATTAC GATAACTTTC TGTTCAATGC TGGTCTGCTG
ATGCACATCA CCGAACGTCA GCAGGCATGG TTCAATTTTT CCCAGGGGGT GGCATTGCCG
GATCCGGGGA AATATTATGG TCGCGGCATC TATGGTGCAG CAGTGAACGG CCATCTTTCC
CTGACAAAGA GCGTGAACGT CAGCGACAGT AAGCTGGAAG GCGTGAAAGT CGATTCTTAT
GAACTGGGCT GGCGCTTTAC CGGTGACAAC CTGCGGACTC AAATCGCGGC ATATTACTCG
CTTTCCAATA AGAGCGTGGA AAGGAATAAA GATCTGACCA TCAGTGTGAA GGACGACAGG
CGCCGTATTT ACGGCGTGGA AGGTGCAGTG GACTACCTGA TCCCGGATAC TGACTGGAGT
ACCGGTGTGA ACTTCAATGT GCTGAAAACC GAGTCGAAAG TGAACGGTCA ATGGCAAAAA
TATGACGTGA AGGAATCAAG TCCATCGAAA GCGACAGCTT ACATTAACTG GGCGCCGGAA
CCGTGGAGTC TGCGTGTACA GAGCACCACT TCTTTCGACG TAAGCGATGC AGAGGGTAAC
GATATTAATG GTTACACTAC CGTCGATTTT ATCAGTAGTT GGCAGCTTCC GGTGGGAACA
CTCAGCTTCA GCGTTGAGAA CCTCTTCGAC CGTGACTATA CCACTGTCTG GGGACAGCGT
GCACCTCTGT ACTACAGCCC GGGTTACGGC CCTGCTTCAC TGTACGACTA CAAAGGCCGG
GGCCGAACCT TTGGTCTGAA CTACTCAGTG CTGTTCTGA
 
Protein sequence
MMRKKYMPRA LGPLLLVVLS PAVAQQNDDN EIIVSASRSN RTVAEMAQTT WVIENAELEQ 
QIQGGKELKD ALAQLIPGLD VSSQSRTNYG MNMRGRPLVV LIDGVRLNSS RSDSRQLDSV
DPFNIDHIEV ISGATALYGG GSTGGLINIV TKKGQPETMM EFEAGTKSGF NSSKDHDERI
AGAVSGGNDH ISGRLSVAYQ KFGGWFDGNG DATLLDNTQT GLQHSNRLDI MGTGTLNIDE
SRQLQLITQY YKSQGDDNYG LNLGKGFSAI SGSSTPYVSK GLNSDRIPGT ERHLISLQYS
DSDFLGQELV GQVYYRDESL RYYPFPTVNA NKQATAFSSS QQDTDQYGMK LTLNSQLMDG
WQITWGLDAE HERFTSNQMF FDLAQASASG GLNNHKIYTT GRYPSYDITN LAAFLQSSYD
INDIFTVSGG VRYQYTENRV DDFIDYTQQQ KIAAGKAISA DAIPGGSVDY DNFLFNAGLL
MHITERQQAW FNFSQGVALP DPGKYYGRGI YGAAVNGHLS LTKSVNVSDS KLEGVKVDSY
ELGWRFTGDN LRTQIAAYYS LSNKSVERNK DLTISVKDDR RRIYGVEGAV DYLIPDTDWS
TGVNFNVLKT ESKVNGQWQK YDVKESSPSK ATAYINWAPE PWSLRVQSTT SFDVSDAEGN
DINGYTTVDF ISSWQLPVGT LSFSVENLFD RDYTTVWGQR APLYYSPGYG PASLYDYKGR
GRTFGLNYSV LF