Gene EcolC_2577 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2577 
Symbol 
ID6064360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2825084 
End bp2826355 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content54% 
IMG OID641601984 
ProductDyp-type peroxidase family protein 
Protein accessionYP_001725535 
Protein GI170020581 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.898799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0884608 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTATA AAGATGAAAA CGGCGTGAAT GAACCGTCAC GCCGACGTTT ACTGAAAGTG 
ATAGGTGCAC TGGCGCTGGC GGGAAGTTGT CCGGTCGCTC ATGCACAAAA AACGCAAAGT
GCGCCGGGTA CGCTTTCACC GGATGCTCGC AATGAGAAAC AGCCGTTTTA TGGTGAGCAT
CAGGCAGGGA TCCTGACGCC ACAACAGGCC GCAATGATGC TGGTGGCGTT TGATGTGCTT
GCCAGCGATA AAGCCGATCT TGAGCGGTTG TTTCGCTTGT TGACTCAGCG TTTTGCTTTT
CTGACTCAGG GCGGAGCAGC ACCAGAAACG CCAAATCCGC GCCTGCCACC ACTCGATTCC
GGCATTCTTG GCGGCTACAT TGCGCCCGAT AATCTCACCA TCACGTTATC GGTGGGTCAC
TCATTGTTTG ATGAGCGCTT TGGCCTTGCG CCACAGATGC CAAAAAAGCT GCAGAAGATG
ACGCGTTTCC CCAACGACTC GCTGGATGCG GCGTTATGTC ATGGTGATGT GTTGCTACAG
ATTTGCGCCA ACACCCAGGA CACGGTTATC CATGCGCTGC GCGATATCAT CAAACACACG
CCGGATTTGC TCAGTGTGCG CTGGAAGCGG GAAGGGTTTA TTTCCGATCA CGCGGCGCGT
AGTAAAGGCA AAGAGACGCC GATTAATTTG CTGGGTTTCA AAGACGGCAC TGCCAATCCC
GATAGCCAGA ATGATAAGTT GATGCAAAAA GTGGTGTGGG TAACGGCAGA TCAGCAGGAG
CCTGCGTGGA CAATCGGTGG CAGCTATCAG GCAGTACGCT TGATTCAGTT TCGAGTGGAA
TTTTGGGACA GAACGCCGCT GAAAGAACAG CAGACGATTT TTGGCCGTGA TAAGCAAACC
GGTGCGCCGC TGGGAATGCA GCATGAGCAT GATGTGCCTG ATTACGCCAG CGACCCGGAA
GGGAAGGTGA TCGCGCTGGA CAGCCATATC CGGCTGGCGA ATCCCCGCAC GGCGGAGAGT
GAGTCCAGCC TGATGCTGCG TCGTGGCTAC AGTTATTCAC TGGGCGTCAC CAACTCCGGG
CAACTGGATA TGGGGTTGCT GTTTGTCTGC TACCAACACG ATCTGGAAAA AGGCTTCCTG
ACAGTACAAA AAAGGCTCAA TGGCGAAGCG CTGGAGGAAT ACGTTAAACC TATCGGCGGC
GGTTATTTTT TTGCGCTGCC GGGGGTGAAG GACGCGAACG ATTATTTCGG AAGCGCGTTA
TTGCGGGTTT AA
 
Protein sequence
MQYKDENGVN EPSRRRLLKV IGALALAGSC PVAHAQKTQS APGTLSPDAR NEKQPFYGEH 
QAGILTPQQA AMMLVAFDVL ASDKADLERL FRLLTQRFAF LTQGGAAPET PNPRLPPLDS
GILGGYIAPD NLTITLSVGH SLFDERFGLA PQMPKKLQKM TRFPNDSLDA ALCHGDVLLQ
ICANTQDTVI HALRDIIKHT PDLLSVRWKR EGFISDHAAR SKGKETPINL LGFKDGTANP
DSQNDKLMQK VVWVTADQQE PAWTIGGSYQ AVRLIQFRVE FWDRTPLKEQ QTIFGRDKQT
GAPLGMQHEH DVPDYASDPE GKVIALDSHI RLANPRTAES ESSLMLRRGY SYSLGVTNSG
QLDMGLLFVC YQHDLEKGFL TVQKRLNGEA LEEYVKPIGG GYFFALPGVK DANDYFGSAL
LRV