Gene Ent638_1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1547 
Symbol 
ID5114515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1703308 
End bp1704591 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content59% 
IMG OID640491734 
ProductDyp-type peroxidase family protein 
Protein accessionYP_001176277 
Protein GI146311203 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.844225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.133092 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAGA AAGAAGAGTA CGGCGTCGCG GAACCTTCCC GACGTCGATT GCTGAAAGGC 
GTGGGGGCAC TGGGTGGTGC GCTCGCGCTG GCTGGCGGCT GCCCGGTAGC GTATGCGGCA
AAACCGCAGA GCGCGCCGGG TACGCTCTCC CCCAATGCGC GGATGGAAAC GCAGCCGTTT
TACGGCGAGC ATCAGGGTGG GATTTTGACG CCACAGCAGG CGGCCATGAT GGTGGTCGCC
TTTGACTCGC TGGCCAGCGA CAAAGCGGAC CTGGAACGTC TGTTCCGCGT GCTGACGAAA
CGCATCGCGT TTCTGACGGC AGGCGGCCCG GCACCCGAAA CGCCTAACCC GCGCATGCCG
CCGATGGATT CTGGCATTCT CGGGCCGTTT ATCGCCCCGG ATAACCTGAC CGTCACGGTG
TCGGTCGGCG AATCGCTCTT TGACGCTCGC TATGGTCTGG AGAAGCACAA GCCGAAAACG
CTGCAGAAGA TGACGCGTTT TCCAAATGAT TCTCTGGATG CGGCGCTGTG TCATGGCGAT
CTGCTGATCC AGATTTGCGC CAACACCCAG GATACGGTGA TCCATGCCTT GCGCGACATC
ATCAAGCACA CGCCGGATTT ACTCAGCGTG CGCTGGAAGC GTGAAGGATT TATCTCCGAT
CACGCGGCGC GAAGCCAGGG GAAAGAGACG CCGGTTAACT TGCTTGGTTT TAAAGACGGC
ACGGCGAATC CGGATAGCAC TGACGCGGCG TTGATGAAAA GCGTGGTGTG GACGACGGCG
GACCAGAGCG AACCTGCATG GGCCGTTGGT GGGAGCTATC AGGCGGTGCG GATTATCCAG
TTCCATGTTG AGTTTTGGGA TCGTACGCCG CTCAAAGAGC AACAGACGAT TTTTGGTCGT
GACAAACAAA GCGGTGCGCC GCTGGGCATG AAAAACGAGC ATGACGTGCC GGATTATGCG
CGCGACCCTG ATGGCGATAC CATTGCGCTG GACAGCCATA TTCGTCTGGC CAACCCGCGT
ACGCCGGAAA CCCAGTCGAA CCTGATGATG CGCCGTGGAT ACAGCTATTC TCTGGGCGTC
ACCAACTCCG GCCAGCTGGA CATGGGGCTG CTGTTTGTCT GCTATCAGCA CGATCTGGAA
AAAGGCTTCT TAACGGTGCA GAAACGCTTA AACGGCGAAG CGCTGGAAGA GTACATAAAA
CCTATCGGCG GCGGCTATTT CTTTGCCCTG CCAGGCGTGC GCGGTGAAAG TACGTACCTC
GCCCAAGGCC TGATCGAAGC GTAA
 
Protein sequence
MNEKEEYGVA EPSRRRLLKG VGALGGALAL AGGCPVAYAA KPQSAPGTLS PNARMETQPF 
YGEHQGGILT PQQAAMMVVA FDSLASDKAD LERLFRVLTK RIAFLTAGGP APETPNPRMP
PMDSGILGPF IAPDNLTVTV SVGESLFDAR YGLEKHKPKT LQKMTRFPND SLDAALCHGD
LLIQICANTQ DTVIHALRDI IKHTPDLLSV RWKREGFISD HAARSQGKET PVNLLGFKDG
TANPDSTDAA LMKSVVWTTA DQSEPAWAVG GSYQAVRIIQ FHVEFWDRTP LKEQQTIFGR
DKQSGAPLGM KNEHDVPDYA RDPDGDTIAL DSHIRLANPR TPETQSNLMM RRGYSYSLGV
TNSGQLDMGL LFVCYQHDLE KGFLTVQKRL NGEALEEYIK PIGGGYFFAL PGVRGESTYL
AQGLIEA