Gene Mflv_4209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_4209 
Symbol 
ID4975522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp4467274 
End bp4468482 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content70% 
IMG OID640458434 
ProductDyp-type peroxidase family protein 
Protein accessionYP_001135466 
Protein GI145224788 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGAGC CGTCCCGTGC GCGGGGCTTC AATCGGCGGC GCCTCCTCCT CACCGGGGGC 
GCCGCGCTGG CGGCCACCAC CACGTTGACG CGCTGCGGCA GAGCTGAATC AGCTACAGCA
GCAACCGGTT TCGGATCCAA TAGCGAGCCG TTCCACGGGG CACATCAAGG CGGTGTGGAC
ACACCCGCCC AGGCGCACGC GTTGTTCGTC GCGTTGGATC TCGTCGCGAC CCCGGATCGT
GCACCGCGCG ACACGCTCGC GTCGGTCCTG CGGTTGTGGA CCGCCGACGC GGCACGGTTG
ACCCAGGGCA GGCCGGCACT GGCCGACACC GAGCCCGAAC TCGCCGCGCG GCCGTCCCGG
CTGACGGTCA CCGTCGGCCT CGGGGCGGGC CTGTTCGAGC GGGCGGGACT GTCCCACCGC
CGGCCCGATT CGGTGACCGA GGTGCCCGCC TTCAGCACCG ACCGTCTCGA ATCACGCTGG
TGCGGAGGCG ATCTCCTGCT GCAGATCTGT GCCGACGATC CGCTGGTCGT CGCCCACACG
GCGCGGGTGT TGCTCAAGAA CGTCCGGACC ATGACAACCC AGCGCTGGCA GCAACGGGGA
TTCCGCAGTG CCCGGGGCTC GGACACCTCG GGCGCGACCA TGCGCAATCT GATGGGCCAG
GTCGACGGTA CGGTCAACCT GCGGACGGCC ACCGACGTGG ACCGCCTGGT CTGGGACGAC
GGAGCCGGAC AGCCCTGGTT CGCCGGCGGC ACGGTGCTCG TGCTGAGGCG CATCCGCACC
GAGCTGGACA GCTGGGACGA ACTCGACCGC ACCAGCAGGG AACTGACTGT CGGCCGTCGA
CTCGATACCG GCGCTCCCCT GACCGGTACG CACGAGTTCG ACGAGCCCGA TCTGGACGCC
GCGGAAAACG GTATCCCGGT CATCCCGCCC AATTCGCATG TCGCGCTGGC CCGGCACCGC
CACGACGGGG AGAGGTTCCT GCGCCGGGCC TACAACTATG ACGATCCGCC GACCGACGCC
ACCACCGACG CGGGCCTCAT CTTCGCCGCC TTCCAACGGG ATCCGGCGCA GCAGTTCGTT
CCCGTCCAGC AGCGGTTGGC AGCGTCCGAC GCACTCAACC CGTGGATCAC CACCATCGGC
TCGGCGGTGT TCGCGATCCT GCCGGGGGCC GCCGACGGCG GCTACCTGGG GCAGAGCCTG
CTCGAATAG
 
Protein sequence
MGEPSRARGF NRRRLLLTGG AALAATTTLT RCGRAESATA ATGFGSNSEP FHGAHQGGVD 
TPAQAHALFV ALDLVATPDR APRDTLASVL RLWTADAARL TQGRPALADT EPELAARPSR
LTVTVGLGAG LFERAGLSHR RPDSVTEVPA FSTDRLESRW CGGDLLLQIC ADDPLVVAHT
ARVLLKNVRT MTTQRWQQRG FRSARGSDTS GATMRNLMGQ VDGTVNLRTA TDVDRLVWDD
GAGQPWFAGG TVLVLRRIRT ELDSWDELDR TSRELTVGRR LDTGAPLTGT HEFDEPDLDA
AENGIPVIPP NSHVALARHR HDGERFLRRA YNYDDPPTDA TTDAGLIFAA FQRDPAQQFV
PVQQRLAASD ALNPWITTIG SAVFAILPGA ADGGYLGQSL LE