Gene TM1040_1422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1422 
Symbol 
ID4078052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1518520 
End bp1519656 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content59% 
IMG OID638006732 
Productdi-haem cytochrome c peroxidase 
Protein accessionYP_613417 
Protein GI99081263 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0511364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0579144 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAAAC TCTTACGCCA TGCTTCCTGC CTTATACTAT GTGCCGCGCC ATTGGTTGCA 
GCCCCCTATC AGACAGCTGA GGATCTTGGG GAGGCGCTGT TCTTTGAGAC GGATCTGTCT
TTGAACCGCA GTCAGTCCTG TGCCACATGC CATGATCCCG CTGCGGGCTT CGTCGATCCC
AGAGGTGACG CCGAAGGCGC GTTTTCCCGT GGAGACGATG GCGTGTCGCT TGGGGGGCGT
CACGCACCCA CCGCAGCCTA TGCGTCTTTG TCGCCTGCCT TTCATCAGAA TGACGCGGGC
GAGTGGGTGG GGGGCCAGTT CTGGGATGGC CGTGCCGCTG ATTTGGCAGA GCAGGCCGGA
GGGCCGATCC TGAACCCCAT CGAACTGGGG CTTTCAAGCC AAGCGGAGGC CATCGCTCGA
CTGGCCGCAC ATCCAGAGTA TGTCGAGAGT TTTCAGACGC TCTATAGTGT CGACATCACC
ACAGATACGG AGGCGGGGTT CAAGGCGCTG ACCCAAGCGC TTGCCGCCTT TGAGAGCAGT
GACGTCTTTC AGCCTTTCGA CAGCAAGTAC GACCGCTTCT TGCGCGGGGA TGTCGAGTTG
AGCAGCGAAG AGGAGTTAGG CCGTCTCTTG TTCTTTTCGG AGCAGTTCAC CAACTGCAAC
CAATGCCACC AGTTGCGCCG CAGCGCGATT GATCCCGCAG AGCCGTTTAG CGATTTTCGC
TATCACAACA TCGGTGTTCC CGCGAATGTG GCGGGTCGCC TTGAAAACGG GGTGGCAGAG
GATTGGGTGG ACGCCGGGCT TTATGAGAAC CCCAGCGTTC TGGATCGCGC GGAGCGCGGC
AAGTTTAAGA CACCCACGCT GCGAAATGTG GCCGTGACGG GGCCCTATAT GCACAATGGG
GTGTTTCAGG ATCTGCGCAC CGTTGTGCTG TTTTACAATC GCTACAACAG CAAGGCCGCA
TCTGCACAGA TCAACCCCGA GACCGGTGCG CCCTGGGGAG AGATCCCGGT GCCGGACACA
TTGGCGCAAA AGGAGCTGAC CCATGGGCCG GCACTGGATG ATCGTCGGGT GGATGCGCTG
GTTGCTTTTC TCAAAACGCT GACCGATTCC CGTTACGAGC CCCTCCTGCA GGAGTAG
 
Protein sequence
MFKLLRHASC LILCAAPLVA APYQTAEDLG EALFFETDLS LNRSQSCATC HDPAAGFVDP 
RGDAEGAFSR GDDGVSLGGR HAPTAAYASL SPAFHQNDAG EWVGGQFWDG RAADLAEQAG
GPILNPIELG LSSQAEAIAR LAAHPEYVES FQTLYSVDIT TDTEAGFKAL TQALAAFESS
DVFQPFDSKY DRFLRGDVEL SSEEELGRLL FFSEQFTNCN QCHQLRRSAI DPAEPFSDFR
YHNIGVPANV AGRLENGVAE DWVDAGLYEN PSVLDRAERG KFKTPTLRNV AVTGPYMHNG
VFQDLRTVVL FYNRYNSKAA SAQINPETGA PWGEIPVPDT LAQKELTHGP ALDDRRVDAL
VAFLKTLTDS RYEPLLQE