Gene B21_02769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02769 
SymbolkpsS 
ID8115947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2951617 
End bp2952849 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content45% 
IMG OID644848960 
Producthypothetical protein 
Protein accessionYP_003000533 
Protein GI251786229 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3562] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000176543 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGACA ACGCACTAAC CATTTTATTA TCGGGTAAAA AATATCTGCT ATTGCAGGGG 
CCAATGGGAC CCTTTTTCAA TGACGTCGCC GAATGGTTAG AGTCATTAGA CCGTAACGCT
GTGAATGTTG TCTTCAACGG CGGGGATCGT TTTTACTGCC GCCATCGACA ATACCTGGCT
TACTACCAAA CGCCGAAAGA GTTTCCCGGT TGGCTGCGAG ATCTCCACCG ACAATATGAC
TTTGATACCA TCCTCTGCTT TGGTGACTGC CGCCCATTGC ACAAAGAAGC AAAACGCTGG
GCAAAGTCGA AAGGGATCCG CTTTCTGGCA TTTGAAGAAG GATATTTACG CCCGCAATTT
ATAACCGTTG AAGAAGGCGG AGTGAACGCA TATTCATCGC TACCGCGCGA CCCGGATTTT
TATCGTAAGT TGCCAGATAT GCCTGCGCCG CACGTTGAGA ACTTAAAACC TTCAACGATG
AAACGTATAG GTCATGCGAT GTGGTATTAC CTGATGGGTT GGCATTACCG TCATGAGTTC
CCTCGCTACC GCCACCACAA ATCATTTTCC CCGTGGTATG AAGCTCGTTG CTGGGTTCGT
GCATACTGGC GCAAGCAACT TTACAAGGTA ACACAGCGTA AGGTATTGCC GAGGTTAATG
AATGAGCTGG ATCAGCGTTA TTATCTTGCC GTTTTGCAGG TGTATAACGA TAGCCAGATT
CGTAACCACA GCAATTATAA CGATGTGCGT GACTATATTA ATGAAGTCAT GTACTCATTT
TCGCGTAAAG CGCCGAAAGA AAGTTATTTG GTGATCAAAC ATCATCCGAT GGATCGTGGT
CACAGACTCT ATCGACCATT AATTAAGCGG TTGAGTAAGG AATATGGCTT AGGTGAGCGA
GTCATATACG TGCACGATCT CCCAATGCCG GAATTATTAC GCCACGCAAA AGCGGTGGTG
ACAATTAACA GTACGGCGGG GATCTCTGCG CTGATTCATA ACAAACCACT CAAAGTGATG
GGCAATGCCC TGTACGACAT CAAAGGCTTG ACGTATAAAG GGCATTTGCA CCAGTTCTGG
CAGGCCGATT TTAAACCGGA TATGAAACTG TTTAAGAAGT TTCGTGGGTA TTTGTTGGTT
AAGACGCAGG TTAATGGGGT TTATTATGGG GAGATGATTT TTAACAAGAT AAAACTGGAT
AAATATTTTC AGAGCTTGTC GCGTCAAGTA TGA
 
Protein sequence
MQDNALTILL SGKKYLLLQG PMGPFFNDVA EWLESLDRNA VNVVFNGGDR FYCRHRQYLA 
YYQTPKEFPG WLRDLHRQYD FDTILCFGDC RPLHKEAKRW AKSKGIRFLA FEEGYLRPQF
ITVEEGGVNA YSSLPRDPDF YRKLPDMPAP HVENLKPSTM KRIGHAMWYY LMGWHYRHEF
PRYRHHKSFS PWYEARCWVR AYWRKQLYKV TQRKVLPRLM NELDQRYYLA VLQVYNDSQI
RNHSNYNDVR DYINEVMYSF SRKAPKESYL VIKHHPMDRG HRLYRPLIKR LSKEYGLGER
VIYVHDLPMP ELLRHAKAVV TINSTAGISA LIHNKPLKVM GNALYDIKGL TYKGHLHQFW
QADFKPDMKL FKKFRGYLLV KTQVNGVYYG EMIFNKIKLD KYFQSLSRQV