Gene B21_01028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01028 
SymbolycdB 
ID8115987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1087067 
End bp1088338 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content54% 
IMG OID644847288 
Producthypothetical protein 
Protein accessionYP_002998861 
Protein GI251784557 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.250617 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTATA AAGATGAAAA CGGCGTGAAT GAACCGTCAC GCCGACGTTT ACTGAAAGTG 
ATAGGTGCAC TGGCGCTGGC GGGAAGTTGT CCGGTCGCTC ATGCACAAAA AACGCAAAGT
GCGCCGGGTA CGCTTTCACC GGATGCTCGC AATGAGAAAC AGCCGTTTTA TGGTGAGCAT
CAGGCAGGGA TCCTGACGCC ACAACAGGCC GCAATGATGC TGGTGGCGTT TGATGTGCTT
GCCAGCGATA AAGCCGATCT TGAGCGGTTG TTTCGCTTGT TGACTCAGCG TTTTGCTTTT
CTGACTCAGG GCGGAGCAGC ACCAGAAACG CCAAATCCGC GCCTGCCACC ACTCGATTCC
GGCATTCTTG GCGGCTACAT TGCGCCCGAT AATCTCACCA TCACGTTATC GGTGGGTCAC
TCATTGTTTG ATGAGCGCTT TGGCCTTGCG CCACAGATGC CAAAAAAGCT GCAGAAGATG
ACGCGTTTCC CCAACGACTC GCTGGATGCG GCGTTATGTC ATGGTGATGT GTTGCTACAG
ATTTGCGCCA ACACCCAGGA CACGGTTATC CATGCGCTGC GCGATATCAT CAAACACACG
CCGGATTTGC TCAGTGTGCG CTGGAAGCGG GAAGGGTTTA TTTCCGATCA CGCGGCGCGT
AGTAAAGGCA AAGAGACGCC GATTAATTTG CTGGGTTTCA AAGACGGCAC TGCCAATCCC
GATAGCCAGA ATGATAAGTT GATGCAAAAA GTGGTGTGGG TAACGGCAGA TCAGCAGGAG
CCTGCGTGGA CAATCGGTGG CAGCTATCAG GCAGTACGCT TGATTCAGTT TCGAGTGGAA
TTTTGGGACA GAACGCCGCT GAAAGAACAG CAGACGATTT TTGGCCGTGA TAAGCAAACC
GGTGCGCCGC TGGGAATGCA GCATGAGCAT GATGTGCCTG ATTACGCCAG CGACCCGGAA
GGGAAGGTGA TCGCGCTGGA CAGCCATATC CGGCTGGCGA ATCCCCGCAC GGCGGAGAGT
GAGTCCAGCC TGATGCTGCG TCGTGGCTAC AGTTATTCAC TGGGCGTCAC CAACTCCGGG
CAACTGGATA TGGGATTGCT GTTTGTCTGC TACCAACACG ATCTGGAAAA AGGCTTCCTG
ACAGTACAAA AAAGGCTCAA TGGCGAAGCG CTGGAGGAAT ACGTTAAACC TATCGGCGGC
GGTTATTTTT TTGCGCTGCC GGGGGTGAAG GACGCGAACG ATTATTTCGG AAGCGCGTTA
TTGCGGGTTT AA
 
Protein sequence
MQYKDENGVN EPSRRRLLKV IGALALAGSC PVAHAQKTQS APGTLSPDAR NEKQPFYGEH 
QAGILTPQQA AMMLVAFDVL ASDKADLERL FRLLTQRFAF LTQGGAAPET PNPRLPPLDS
GILGGYIAPD NLTITLSVGH SLFDERFGLA PQMPKKLQKM TRFPNDSLDA ALCHGDVLLQ
ICANTQDTVI HALRDIIKHT PDLLSVRWKR EGFISDHAAR SKGKETPINL LGFKDGTANP
DSQNDKLMQK VVWVTADQQE PAWTIGGSYQ AVRLIQFRVE FWDRTPLKEQ QTIFGRDKQT
GAPLGMQHEH DVPDYASDPE GKVIALDSHI RLANPRTAES ESSLMLRRGY SYSLGVTNSG
QLDMGLLFVC YQHDLEKGFL TVQKRLNGEA LEEYVKPIGG GYFFALPGVK DANDYFGSAL
LRV