Gene EcSMS35_3153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3153 
SymbolibrA 
ID6142771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3237518 
End bp3238744 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content50% 
IMG OID641618004 
Productimmunoglobulin-binding regulator A 
Protein accessionYP_001745154 
Protein GI170679850 
COG category[R] General function prediction only 
COG ID[COG3969] Predicted phosphoadenosine phosphosulfate sulfotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.691119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGGAA CTATACGAAA GATAATAAAA GAGGATGATG TATTTCATTC TTCTATTCGC 
CGTATTGAAT GGTTGTTTGA AACATTCTCT TCTGTCTGTT TGTCTTTTTC CGGAGGAAAA
GACTCCACTG TGCTGCTCCA TCTTGCGGCT GATGTGGCTC GCAGGAAGAA ACGTCGTTTC
TCTGTATTAT TCATTGACTG GGAAGCTCAG TATCAGTGCA CCATTGAACA TGTTCAGAAG
ATGCGGGGAA TGTACCGGGA TGTGACGGAT ACCTTTTACT GGGTGGCACT CCCCCTGACT
ACGGTAAACG GTGTCTCTCA GTTTCAGCCG GAATGGATAT GCTGGGAACC AGGTGTTGAG
TGGGTTCGTC AGCCACCAGA TGACGCTATT ACAGATATGT CGTATTTCCC ATTTTATCGG
TATGCCATGA CGTTTGAAGA ATTTGTTCCG GCATTTTCTT CCTGGTTTGC CGGTAACCGG
TGTGGAGTGG CAATACTGAC TGGTGTTCGT GCTGATGAAT CGCTCAATCG CTTTATGGGA
CTGGTGTCTC AGCGCAAACT GAGATATGCA GATGATAAAC CCTGGACCAC AGCGTCACCT
GAAGGGTTTT ATTACACCTT GTATCCGTTG TATGACTGGA AAGCCCGTGA TATATGGATA
TATAACGCCA GAACCCGGGC TATCTACAAT CCCCTGTATG ACCTGATGTA CCGTGCCGGC
GTGCCGTTAC GCAACATGCG GGTCTGTGAG CCTTTTGGCC CGGAACAGCG TAAGGGACTG
TGGCTTTACC ATGTTCTGGA GCCGGAAACC TGGGCCAGGA TGTGTGAGCG GGTGTCGGGT
GCTGCCAGCG GGGCGCTTTA TGCCAATGAG AGCGGTGCCT ATTTTGCCCT GCGTAAGCGT
ATCACGAAGC CACCTCATCA TACCTGGCGT AGCTATGCGA TGTTCCTGCT GGATGTGATG
CCGGAAAGAA CGGCAGAACA TTACCGTAAT AAAATTGCTG TCTACCTGCG CTGGTATCAG
ACGCGGGGCT TCCCGGATGA CATCCCGGAT GAACAGGAGA ATGACCTGGG GAGCCGGGAT
ATCCCGTCCT GGCGACGTAT CTGTAAGACA CTCATAAAGA ATGATTTCTG GTGTCGGACC
CTCTCCTTCA GTCCGAACAA ACCCCGGCAC TATGAACGTT ATCTGCAGCG TATGAAAGAA
AGGAGGAAGG AATGGGGGAT TCTGTGA
 
Protein sequence
MSGTIRKIIK EDDVFHSSIR RIEWLFETFS SVCLSFSGGK DSTVLLHLAA DVARRKKRRF 
SVLFIDWEAQ YQCTIEHVQK MRGMYRDVTD TFYWVALPLT TVNGVSQFQP EWICWEPGVE
WVRQPPDDAI TDMSYFPFYR YAMTFEEFVP AFSSWFAGNR CGVAILTGVR ADESLNRFMG
LVSQRKLRYA DDKPWTTASP EGFYYTLYPL YDWKARDIWI YNARTRAIYN PLYDLMYRAG
VPLRNMRVCE PFGPEQRKGL WLYHVLEPET WARMCERVSG AASGALYANE SGAYFALRKR
ITKPPHHTWR SYAMFLLDVM PERTAEHYRN KIAVYLRWYQ TRGFPDDIPD EQENDLGSRD
IPSWRRICKT LIKNDFWCRT LSFSPNKPRH YERYLQRMKE RRKEWGIL