Gene Mvan_4213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4213 
Symbol 
ID4645898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4520003 
End bp4521337 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content64% 
IMG OID639807680 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_954996 
Protein GI120405167 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.144369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACATT TCCCGAAGCC TGCCGCAGGC AGCTGGACAG AGAACTGGCC CGAGCTCGGC 
ACCGCGCCGG TCGATTACAC GGACTCCATC GACCCCGAGC AGTGGAAGCT GGAGCAGCAG
GCCATCTTCC GCAAGCTGTG GCTGCATGTC GGCCGCGTGG AGCGGCTGCC GAAGGCCGGC
AGCTACTTCA CCCGTGAGAT GCCGTCGGTC GGCGCCGGCA CCTCGATCAT CGTCAACAAG
GACAAGGACG GCCAGGTCCG CGCCTTCTAC AACCTGTGCC GGCACCGCGG CAACAAGCTG
GTCTGGAACG ACTACCCCGG CGAAGAGGTC TCCGGATCCT GCCGGCAGTT CACCTGCAAG
TACCACGCGT GGCGCTACGC GCTCGACGGT GAGCTGACCT TCATCCAGCA GGAGGACGAG
TTCTTCGACG TCGACAAGGC CGACTACCCG CTCAAGCCCG TCCGCTGCGA GGTCTGGGAA
GGCTTCATCT TCGTCAACTT CGACGACGAC GCCGAACCGC TGGTCGACTA CCTCGGCGAC
TTCGCCAAGG GGCTGGAGGG CTACCCCTTC CACGAGATGA CCGAGGTGTA CAGCTACCGC
GCCGAAATCA ACTCGAACTG GAAGCTTTTC ATCGACGCGT TCGTCGAGTT CTACCACGCA
CCGATCCTTC ACATGAAGCA GGCGACTGCC GAAGAGGCCG CCAAGCTCGC CAAGGTCGGG
TTCGAAGCCC TGCACTACGA CATCAAGGAT CAACACTCGA TGATCTCGTC GTGGGGTGGT
ATGAGCCCGC CCAAGGACCT CAACATGGTC AAGCCGATCG AGCGGATCCT GCACAGCGGC
CTGTTCGGTC CGTGGGACCG GCCCGACATC AAGGGCATCC TGCCCGACGA GCTGCCGCCT
GCCGTCAACC CGGCCCGACA GCCGACGTGG GGCCAGGACT CGTTCGAGTT CTTCCCGAAC
TTCACGCTGC TGCTGTGGGC GCCGGGCTGG TACCTGACCT ACAACTACTG GCCGACCGCC
GTGGACAAGC ACATCTTCGA GTGCAACCTG TACTTCGTGC CCCCGAAGAA CACCCGCCAG
CGGCTGTCCC AGGAACTCGC GGCCGTGACG TTCAAGGAGT ACGCACTGCA GGACGCCAAC
ACCCTCGAGG CGACGCAGAC CCAGATCGGC ACCCGCGCCG TCACCGAATT CCCGTTGTGC
GACCAAGAGA TCCTGCTGCG CCACCTGCAC CACACGGCGC ACAAGTACGT CGACAAGTAC
AAGCTCGAGC AGGCCGCGAA GGCTGCCACC AACGGATCTG CCAGCAAGAC CGAAAAGGAG
GGGGCCAATG TCTGA
 
Protein sequence
MAHFPKPAAG SWTENWPELG TAPVDYTDSI DPEQWKLEQQ AIFRKLWLHV GRVERLPKAG 
SYFTREMPSV GAGTSIIVNK DKDGQVRAFY NLCRHRGNKL VWNDYPGEEV SGSCRQFTCK
YHAWRYALDG ELTFIQQEDE FFDVDKADYP LKPVRCEVWE GFIFVNFDDD AEPLVDYLGD
FAKGLEGYPF HEMTEVYSYR AEINSNWKLF IDAFVEFYHA PILHMKQATA EEAAKLAKVG
FEALHYDIKD QHSMISSWGG MSPPKDLNMV KPIERILHSG LFGPWDRPDI KGILPDELPP
AVNPARQPTW GQDSFEFFPN FTLLLWAPGW YLTYNYWPTA VDKHIFECNL YFVPPKNTRQ
RLSQELAAVT FKEYALQDAN TLEATQTQIG TRAVTEFPLC DQEILLRHLH HTAHKYVDKY
KLEQAAKAAT NGSASKTEKE GANV