Gene Mvan_5225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5225 
Symbol 
ID4644326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5595250 
End bp5596404 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content64% 
IMG OID639808700 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_956002 
Protein GI120406173 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.532387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.253584 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCG AGGCATCTGC GGGAACCGCA CATATTCGCG AGATCGACAC CGGAGCGCTG 
CCCGACCGGT ACGCCAGGGG TTGGCACTGC CTGGGTCCGG TGAAGAACTT CCTGGACGGG
AAACCGCACG GCATCGAGAT CTTCGGCACC ATGCTGGTGG TCTTCGCCGA CTCGCAGGGT
GAGCTCAACG TTCTCGACGG CTACTGCAGG CACATGGGCG GCAACCTGGC CCAGGGCGAG
ATCAAGGGCG ACGAGGTCGC CTGCCCGTTC CACGACTGGC GCTGGGGCGG CGACGGCAAG
TGCAAGCTGG TCCCCTACGC AAAGCGCACT CCGCGCCTGG CCCGCACACG GGCCTGGCAC
ACCGACGTGC GCGGCGGCCT GCTGTTCGTG TGGCACGACC ACGAGGGCAA TCCACCCCAG
CCGGAAGTCC GCATCCCGGA GATCCCGGAG TGGTCCAGCG GTGAGTGGAC CGACTGGAAG
TGGAACACGC TGCTCATCGA GGGCTCCAAC TGCCGCGAGA TCATCGACAA CGTCACCGAC
ATGGCGCACT TCTTCTACAT CCACTTCGGG CTGCCGACCT ACTTCAAGAA CGTCTTCGAA
GGTCACATCG CCAGCCAGTA CCTGCACAAC GTGGGCCGGC CCGACGTCAA CGACATGGGC
ACCGCCTACG GGGAAGCCTC CCTGGACTCC GAGGCCAGCT ACTTCGGGCC GTCGTTCATG
ATCAACTGGC TGCACAACAA GTACGGCGAC TTCAAGGCCG AATCGATCCT GATCAACTGC
CACTACCCGG TGACCCAGGA TTCCTTCGTG CTGCAGTGGG GCGTCATCGT GGAGAAGCCC
AAGGGCCTCG ACGACGGCAC CACCCAGAAG CTGGCCGACG CGTTCACCGA CGGCGTGAGC
AAAGGCTTCA TGCAGGACGT CGAGATCTGG AAGCACAAGA CCCGCATCGA CAATCCGCTG
CTGGTGGAGG AAGACGGCGC GGTCTACCAG ATGCGGCGCT GGTATCAGCA GTTCTACGTC
GATGTCGCCG ACGTGACGCC CGAGATGACC GACCGGTTCG AGATGGAAGT CGACACCACG
GCTGCGGTGC AGAAGTGGAA CGTCGAGGTC GAGGAGAACC TGAAGGCCAG GGAAACCGAG
ACGCAGTCGA CATGA
 
Protein sequence
MSTEASAGTA HIREIDTGAL PDRYARGWHC LGPVKNFLDG KPHGIEIFGT MLVVFADSQG 
ELNVLDGYCR HMGGNLAQGE IKGDEVACPF HDWRWGGDGK CKLVPYAKRT PRLARTRAWH
TDVRGGLLFV WHDHEGNPPQ PEVRIPEIPE WSSGEWTDWK WNTLLIEGSN CREIIDNVTD
MAHFFYIHFG LPTYFKNVFE GHIASQYLHN VGRPDVNDMG TAYGEASLDS EASYFGPSFM
INWLHNKYGD FKAESILINC HYPVTQDSFV LQWGVIVEKP KGLDDGTTQK LADAFTDGVS
KGFMQDVEIW KHKTRIDNPL LVEEDGAVYQ MRRWYQQFYV DVADVTPEMT DRFEMEVDTT
AAVQKWNVEV EENLKARETE TQST