Gene B21_02768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02768 
SymbolkpsC 
ID8115953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2949555 
End bp2951582 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content51% 
IMG OID644848959 
Producthypothetical protein 
Protein accessionYP_003000532 
Protein GI251786228 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3563] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000859987 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGGCA TTTTCTCGTC CGGTATCTGG CGTATTCCGC ATCTGGAGAA ATTTCTGGCG 
CAACCGTGCC AGAAACTTTC TCTGCTGCGC CCTGTTCCGC AAGAAGTTGA TGCTATCGCC
GTGTGGGGAC ATCGTCCCAG CGCGGCGAAA CCAGTCGCCA TCGCCAAAGC AGCGGGAAAA
CCCGTCATTC GTCTGGAAGA TGGATTTGTG CGTTCGCTGG ATCTTGGCGT CAATGGCGAG
CCGCCGCTTT CTCTGGTGGT GGATGATTGT GGCATTTACT ACGATGCCAG CAAGCCTTCA
GCACTGGAGA AACTGGTAAA GGATAAAGCC GGAAATACAG CTCTGATAAG CCAGGCCAGA
GAAGCGATGC ACACCATCGT GACCGGGGAT TTGTCGAAAT ATAACCTGGC ACCTGCGTTT
GTGGCTGATG AGTCAGAACG TTCAGACATC GTTCTGGTTG TCGATCAGAC ATTTAATGAT
ATGTCAGTGA CGTATGGCAA TGCTGGCCCG CATGAGTTTG CTGCCATGCT GGAAGCCGCG
ATGGCGGAAA ATCCTCAAGC CGAAATTTGG GTGAAGGTGC ATCCGGATGT CCTGGAAGGA
AAGAAAACAG GTTATTTCGC TGATCTGTGC GCCACGCAAC GAGTACGTTT GATTGCCGAG
AATGTCAGCC CGCAGTCGCT GTTGCGACAC GTTTCCCGGG TTTACGTCGT GACCTCCCAA
TATGGCTTTG AAGCCTTGCT GGCAGGAAAA CCAGTAACAT GCTTCGGCCA GCCCTGGTAT
GCAGGTTGGG GCTTAACCGA CGATCGTCAT CCACAGTCCG CTTTGTTATC TGCCCGACGC
GGTTCTGCCA CGCTGGAGGA ACTTTTTGCC GCTGCATACC TGCGTTACTG TCGCTATATC
GATCTGCAAA CGGGAGAAGT AAGCGATCTA TTTACCGTGC TGCAATGGCT GCAATTACAA
CGTCGACATC TGCAACAGCG TAATGGTTAT TTATGGGCGC CAGGCTTAAC GCTGTGGAAG
TCAGCGATCC TGAAACCTTT CTTGCAAACG GCAACAAACC GGCTGAGTTT TTCACGTCGT
TGTACTGCGG CGAGCGCCTG CGTGGTATGG GGTGTAAAGG GAGAACAGCA ATGGCGAGCC
GAAGCGCAAC GAAAATCACT GCCGTTATGG CGAATGGAAG ATGGTTTTCT GCGTTCATCC
GGACTTGGCT CTGACCTGCT GCCGCCGCTA TCGTTGGTGC TGGATAAACG CGGGATCTAC
TATGACGCCA CGCGCCCCAG CGACCTGGAA GTGCTGATTA ATCACAGCCA GTTAACGCTG
GCGCAGCAGA TGCGAGCTGA AAAATTACGC CAGCGGCTAG TTGAAAGTAA ACTGAGCAAG
TACAACTTGG GGGCCGATTT CTCTCTGCCT GCCGAAGCCA AAGATAAAAA AATCATCCTG
GTGCCGGGTC AGGTAGAAGA CGATGCCTCT ATTAAAACTG GCACTGTGTC GATTAAGAGC
AACCTTGAGT TATTACGCAC AGTGCGCGAG CGTAATCCGC ACGCCTACAT TATTTATAAA
CCGCACCCGG ATGTATTAGT GGGGAATCGT AAGGGCAATA TTCCGGCTAA ATTGATCGCT
GAACTTGCCG ACTATCAGGC ACTGGACGCA GATATTATTC AATGCATTCA GCGCGCAGAT
GAAGTGCACA CCATGACATC ATTGTCCGGG TTTGAAGCGT TATTACATGG CAAGCAAGTT
CATTGTTACG GCCTGCCCTT CTATGCCGGT TGGGGTTTAA CCGCTGATGA ACATCACTGC
CCGCGCCGCG AGCGCAGATT AACGATAGCA GACTTGATCT ATCAGGCGTT GATTGTTTAT
CCAACCTATA TCCACCCAAT ACGGCTACAA CCTATTACTG TTGAAGAGGC GGCGGAATAT
TTGATCCAGA CGCCGCGCAA GCCGATGTTT ATTACCCGAA AAAAAGCGGG GCGAGTAATA
CGCTATTACC GCAAATTAAT TATGTTCTGC AAGGTCAGAT TTGGCTAA
 
Protein sequence
MIGIFSSGIW RIPHLEKFLA QPCQKLSLLR PVPQEVDAIA VWGHRPSAAK PVAIAKAAGK 
PVIRLEDGFV RSLDLGVNGE PPLSLVVDDC GIYYDASKPS ALEKLVKDKA GNTALISQAR
EAMHTIVTGD LSKYNLAPAF VADESERSDI VLVVDQTFND MSVTYGNAGP HEFAAMLEAA
MAENPQAEIW VKVHPDVLEG KKTGYFADLC ATQRVRLIAE NVSPQSLLRH VSRVYVVTSQ
YGFEALLAGK PVTCFGQPWY AGWGLTDDRH PQSALLSARR GSATLEELFA AAYLRYCRYI
DLQTGEVSDL FTVLQWLQLQ RRHLQQRNGY LWAPGLTLWK SAILKPFLQT ATNRLSFSRR
CTAASACVVW GVKGEQQWRA EAQRKSLPLW RMEDGFLRSS GLGSDLLPPL SLVLDKRGIY
YDATRPSDLE VLINHSQLTL AQQMRAEKLR QRLVESKLSK YNLGADFSLP AEAKDKKIIL
VPGQVEDDAS IKTGTVSIKS NLELLRTVRE RNPHAYIIYK PHPDVLVGNR KGNIPAKLIA
ELADYQALDA DIIQCIQRAD EVHTMTSLSG FEALLHGKQV HCYGLPFYAG WGLTADEHHC
PRRERRLTIA DLIYQALIVY PTYIHPIRLQ PITVEEAAEY LIQTPRKPMF ITRKKAGRVI
RYYRKLIMFC KVRFG