Gene ECD_00021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00021 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp20630 
End bp21775 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content40% 
IMG OID 
Productputative usher protein 
Protein accessionACT41923 
Protein GI253976253 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.102676 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCGCCT GGCGTTATGC TTCGCAGGAT TACAGGACAT TCAGCGACCA TCTTTACGAA 
AATGATAAAC ACTATCATCA GAGTGATTAT AATGATTTTT ATGATATTGG CAGAAAAAAT
AGCCTTTCTG CCAATATTAT GCAACCTTTA TCCAATAATC TGGGAAATGT ATCATTAAGT
GCGCTTTGGC GGAATTACTG GGGGCGAAGT GGAAATGCTA AAGATTACCA ATTCAGTTAC
TCCAATAACT GGCAACACAT TAGTTATACT TTCTCTGCCA GCCAATCTTA TGATGAAAAT
AATAAAGAAG AGGAGCGTTT TAATCTGTTT ATCTCCATTC CTTTCTACTG GGGGGATGAT
ATTGCCAAAA CACGTCACCA AATTAACTTA TCGAATTCGA CCTCATTTTC CAAAGATGGC
TATTCCTCCA ACAATACTGG AATTACTGGC ATAGCCGGTG AACATGATCA GTTAAATTAT
GGTATATATG TTAATCAGCA ACAACAAAAT AATGATACCT CGCTTGGTAC GAATTTAAGC
TGGAGAACTC CCATCGCCAT AATAGATGGC AGCTATAGTC ATTCTAAAAA CGCCTGGCAA
AGTGGTGGAA GTATTAGTAG TGGATTAGTT GTCTGGTCCG GTGGTATTAA TATCACTAAC
CAGTTATCCG ATACATTTGC AATTCTGGAT GCGCCTGGAT TAGAAGGCGC GCATATTAAT
GGACAAAAAT ACAACCGAAC AAACAGCAAA GGCCAGGTTG TTTACGACCC GATTATACCT
CATCGTGAAA ACCATCTGGT ACTTGATATA GCAAACAGTG AAAGTGAAAC AGAATTGCAG
GGCAATCGTC AAATTATTGC GCCTTACCGT GGAGCAGTTT CTTATGTGCA GTTTACAACT
GACCAACGTA AGCCTTGGTA TATACAGGCA CTGCGTCCCG ATGGTTCGCC ATTAACCTTT
GGCTATGACG TACTGGATCT CCAGGAAAAC AATATTGGAG TCGTTGGCCA GGGTAGTCGC
CTTTTTATTC GCGTAGATGA AATTCCAACT GGCATAAAAG TTGCTCTCAA TGATGAACAG
AATTTATTCT GTACTATTAC TTTTCAACAC GTTATCGATG AAAACAAAAC ATATATATGC
CAGTAA
 
Protein sequence
MAAWRYASQD YRTFSDHLYE NDKHYHQSDY NDFYDIGRKN SLSANIMQPL SNNLGNVSLS 
ALWRNYWGRS GNAKDYQFSY SNNWQHISYT FSASQSYDEN NKEEERFNLF ISIPFYWGDD
IAKTRHQINL SNSTSFSKDG YSSNNTGITG IAGEHDQLNY GIYVNQQQQN NDTSLGTNLS
WRTPIAIIDG SYSHSKNAWQ SGGSISSGLV VWSGGINITN QLSDTFAILD APGLEGAHIN
GQKYNRTNSK GQVVYDPIIP HRENHLVLDI ANSESETELQ GNRQIIAPYR GAVSYVQFTT
DQRKPWYIQA LRPDGSPLTF GYDVLDLQEN NIGVVGQGSR LFIRVDEIPT GIKVALNDEQ
NLFCTITFQH VIDENKTYIC Q