Gene B21_02223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02223 
SymbolyfcU 
ID8114110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2339900 
End bp2341636 
Gene Length1737 bp 
Protein Length579 aa 
Translation table11 
GC content51% 
IMG OID644848429 
Producthypothetical protein 
Protein accessionYP_003000002 
Protein GI251785698 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGACC ATTCTCTTTT TCGATTACGG ATACTTCCGT GGTGCATTGC GCTGGCAATG 
TCAGGGAGTT ATAGCAGTGT CTGGGCTGAA GACGACATTC AGTTTGATTC CCGTTTTCTG
GAATTAAAAG GCGACACGAA AATTGATCTG AAGCGTTTTT CCAGTCAGGG ATATGTTGAG
CCCGGAAAAT ACAATTTACA GGTTCAACTA AATAAACAGC CATTGGCGGA AGAGCACGAT
ATTTACTGGT ACGCTGGTGA AGATGACGCG AGCAAAACTT ATGCTTGTCT GACACCGGAA
CTGGTGGCGC AGTTTGGTTT AAAAGAAGAT GTGGCGAAAA ATCTGCAATG GAGCCACGAT
GGTAAATGCC TGAAACCCGG TCAACTGGAA GGCATGGAAA TTAAGGCTGA TTTAAGCCAG
TCCGCATTAG TCATTTCATT ACCGCAGGCT TACCTCGAAT ATACCTGGCC CGACTGGGAT
CCGCCTTCTC GTTGGGATGA TGGCATCTCC GGGATCATCG CGGACTACAG CATCACTGCG
CAAACACGAC ACGAAGAAAA TGGCGGTGAT GACTCTAACG AGATCAGCGG CAACGGGACG
GTCGGGGTTA ACCTGGGGCC GTGGCGTGTG CGTGCCGACT GGCAGACCGA CTATCAACAT
ACCCGCAGTA ATGATGATGA CGATGAATTT AGCGGCGATG ACACACAAAA AAAATGGGAG
TGGAGTCGCT ACTATGCCTG GCGGGCGTTA CCGTCATTGA AAGCCAAACT GGCGCTGGGC
GAAGATTACC TCAATTCCGA TATTTTCGAC GGTTTTAACT ATGTTGGCGG CAGTGTCAGT
ACTGACGATC AAATGTTGCC TCCCAACCTG CGTGGCTACG CGCCAGACAT TTCCGGCGTG
GCGCACACCA CAGCAAAAGT GACCGTCAGC CAGATGGGGC GTGTGATTTA CGAAACGCAG
GTTCCGGCCG GGCCGTTTCG TATTCAGGAT CTTGGTGATT CCATCTCCGG TACGTTGCAT
GTTCGCATTG AAGAACAGAA CGGCCAGGTG CAGGAATATG ACATCAGCAC CGCCTCGATG
CCATACCTTA CTCGCCCAGG CCAGGTTCGT TATAAAGTCA TGATGGGACG TCCGCAAGAG
TGGGGCCACC ATGTCGAGGG GGGATTTTTC TCTGGTGCTG AAGCCTCCTG GGGGATCGCT
AACGGTTGGT CGCTATATGG CGGCGCGCTG GGAGATAAAA ACTATCAGTC TGCGGCACTT
GGCATCGGTC GCGATTTGTC TACGTTCGGC GCGGTTGCGT TTGATGTTAC CCACTCGCAT
ACCAAACTGG ATAAAGACAC CGCTTATGGC AAAGGTTCGC TGGACGGTAA CTCCTTCCGT
GTGAGTTATT CCAAAGACTT TGACCAGCTC AACAGTCGCG TCACCTTCGC TGGATATCGC
TTCTCGGAAG AGAACTTTAT GACCATGAGC GAGTACCTGG ATGCCAGTGA CAGCGAAATG
GTCCGCACGG GCAACGACAA AGAGATGTAC ACCGCCACGT ATAACCAGAA CTTCCGCGAT
GCGGGTGTTT CGGTTTATCT CAACTATACC CGCCATACTT ACTGGGATCG CGAGGAGCAG
ACAAACTACA ACATCATGCT CTCCCACTAT TTCAATATGG GTAGCATTCG CAATATGAGC
GTTTCCCTGA CTGGCTACCG CTACGAGTAT GACAACCGGG CGGATAAGGG CATGTAC
 
Protein sequence
MPDHSLFRLR ILPWCIALAM SGSYSSVWAE DDIQFDSRFL ELKGDTKIDL KRFSSQGYVE 
PGKYNLQVQL NKQPLAEEHD IYWYAGEDDA SKTYACLTPE LVAQFGLKED VAKNLQWSHD
GKCLKPGQLE GMEIKADLSQ SALVISLPQA YLEYTWPDWD PPSRWDDGIS GIIADYSITA
QTRHEENGGD DSNEISGNGT VGVNLGPWRV RADWQTDYQH TRSNDDDDEF SGDDTQKKWE
WSRYYAWRAL PSLKAKLALG EDYLNSDIFD GFNYVGGSVS TDDQMLPPNL RGYAPDISGV
AHTTAKVTVS QMGRVIYETQ VPAGPFRIQD LGDSISGTLH VRIEEQNGQV QEYDISTASM
PYLTRPGQVR YKVMMGRPQE WGHHVEGGFF SGAEASWGIA NGWSLYGGAL GDKNYQSAAL
GIGRDLSTFG AVAFDVTHSH TKLDKDTAYG KGSLDGNSFR VSYSKDFDQL NSRVTFAGYR
FSEENFMTMS EYLDASDSEM VRTGNDKEMY TATYNQNFRD AGVSVYLNYT RHTYWDREEQ
TNYNIMLSHY FNMGSIRNMS VSLTGYRYEY DNRADKGMY