Gene B21_00022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00022 
Symbolybl2 
ID8112760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp20630 
End bp21637 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content40% 
IMG OID644846317 
Producthypothetical protein 
Protein accessionYP_002997890 
Protein GI251783586 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0910551 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACCTT TATCCAATAA TCTGGGAAAT GTATCATTAA GTGCGCTTTG GCGGAATTAC 
TGGGGGCGAA GTGGAAATGC TAAAGATTAC CAATTCAGTT ACTCCAATAA CTGGCAACAC
ATTAGTTATA CTTTCTCTGC CAGCCAATCT TATGATGAAA ATAATAAAGA AGAGGAGCGT
TTTAATCTGT TTATCTCCAT TCCTTTCTAC TGGGGGGATG ATATTGCCAA AACACGTCAC
CAAATTAACT TATCGAATTC GACCTCATTT TCCAAAGATG GCTATTCCTC CAACAATACT
GGAATTACTG GCATAGCCGG TGAACATGAT CAGTTAAATT ATGGTATATA TGTTAATCAG
CAACAACAAA ATAATGATAC CTCGCTTGGT ACGAATTTAA GCTGGAGAAC TCCCATCGCC
ATAATAGATG GCAGCTATAG TCATTCTAAA AACGCCTGGC AAAGTGGTGG AAGTATTAGT
AGTGGATTAG TTGTCTGGTC CGGTGGTATT AATATCACTA ACCAGTTATC CGATACATTT
GCAATTCTGG ATGCGCCTGG ATTAGAAGGC GCGCATATTA ATGGACAAAA ATACAACCGA
ACAAACAGCA AAGGCCAGGT TGTTTACGAC CCGATTATAC CTCATCGTGA AAACCATCTG
GTACTTGATA TAGCAAACAG TGAAAGTGAA ACAGAATTGC AGGGCAATCG TCAAATTATT
GCGCCTTACC GTGGAGCAGT TTCTTATGTG CAGTTTACAA CTGACCAACG TAAGCCTTGG
TATATACAGG CACTGCGTCC CGATGGTTCG CCATTAACCT TTGGCTATGA CGTACTGGAT
CTCCAGGAAA ACAATATTGG AGTCGTTGGC CAGGGTAGTC GCCTTTTTAT TCGCGTAGAT
GAAATTCCAA CTGGCATAAA AGTTGCTCTC AATGATGAAC AGAATTTATT CTGTACTATT
ACTTTTCAAC ACGTTATCGA TGAAAACAAA ACATATATAT GCCAGTAA
 
Protein sequence
MQPLSNNLGN VSLSALWRNY WGRSGNAKDY QFSYSNNWQH ISYTFSASQS YDENNKEEER 
FNLFISIPFY WGDDIAKTRH QINLSNSTSF SKDGYSSNNT GITGIAGEHD QLNYGIYVNQ
QQQNNDTSLG TNLSWRTPIA IIDGSYSHSK NAWQSGGSIS SGLVVWSGGI NITNQLSDTF
AILDAPGLEG AHINGQKYNR TNSKGQVVYD PIIPHRENHL VLDIANSESE TELQGNRQII
APYRGAVSYV QFTTDQRKPW YIQALRPDGS PLTFGYDVLD LQENNIGVVG QGSRLFIRVD
EIPTGIKVAL NDEQNLFCTI TFQHVIDENK TYICQ