Gene B21_02884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02884 
SymbolygjD 
ID8116449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3074816 
End bp3075829 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content58% 
IMG OID644849072 
Producthypothetical protein 
Protein accessionYP_003000645 
Protein GI251786341 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000164099 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGTAC TGGGTATTGA AACTTCCTGC GATGAAACCG GCATCGCCAT TTACGACGAT 
GAAAAAGGTT TGTTAGCCAA CCAATTGTAT AGTCAGGTGA AATTGCACGC TGACTACGGC
GGCGTCGTGC CTGAACTGGC CTCCCGCGAT CATGTGCGTA AAACCGTACC GTTGATCCAG
GCGGCGCTAA AGGAGTCTGG TTTAACGGCA AAAGACATTG ATGCTGTGGC CTATACCGCA
GGCCCTGGAT TAGTCGGCGC GCTACTGGTT GGCGCGACCG TGGGGCGTTC TCTGGCGTTT
GCCTGGGACG TTCCGGCGAT CCCTGTACAC CATATGGAAG GGCATCTGTT AGCGCCGATG
CTGGAAGATA ACCCGCCGGA ATTTCCGTTT GTTGCGCTGC TTGTTTCCGG CGGTCATACG
CAGTTAATCA GCGTGACTGG CATTGGTCAG TACGAGCTGC TCGGCGAGTC TATCGATGAT
GCCGCCGGGG AAGCGTTTGA TAAAACCGCG AAGCTGCTGG GGCTGGATTA TCCTGGCGGG
CCGTTACTGT CGAAAATGGC GGCTCAGGGT ACTGCCGGGC GCTTTGTCTT CCCGCGTCCG
ATGACCGACC GTCCGGGGCT GGATTTCAGC TTCTCCGGCC TGAAAACCTT CGCGGCAAAT
ACCATTCGTG ACAACGGCAC CGACGACCAG ACGCGTGCTG ATATCGCCCG CGCCTTTGAA
GATGCGGTGG TCGATACGCT GATGATTAAG TGCAAGCGGG CGCTGGATCA GACGGGCTTT
AAGCGACTGG TCATGGCGGG CGGCGTGAGT GCTAACCGTA CGTTACGGGC GAAGCTGGCT
GAAATGATGA AAAAACGCCG CGGCGAAGTG TTCTACGCGC GTCCGGAATT TTGTACTGAT
AACGGCGCGA TGATCGCCTA TGCCGGAATG GTGCGGTTTA AAGCAGGCGC GACGGCGGAT
CTCGGCGTTA GCGTGCGTCC GCGCTGGCCG CTGGCGGAGT TACCGGCTGC GTAA
 
Protein sequence
MRVLGIETSC DETGIAIYDD EKGLLANQLY SQVKLHADYG GVVPELASRD HVRKTVPLIQ 
AALKESGLTA KDIDAVAYTA GPGLVGALLV GATVGRSLAF AWDVPAIPVH HMEGHLLAPM
LEDNPPEFPF VALLVSGGHT QLISVTGIGQ YELLGESIDD AAGEAFDKTA KLLGLDYPGG
PLLSKMAAQG TAGRFVFPRP MTDRPGLDFS FSGLKTFAAN TIRDNGTDDQ TRADIARAFE
DAVVDTLMIK CKRALDQTGF KRLVMAGGVS ANRTLRAKLA EMMKKRRGEV FYARPEFCTD
NGAMIAYAGM VRFKAGATAD LGVSVRPRWP LAELPAA