Gene Mvan_5211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5211 
Symbol 
ID4644312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5579551 
End bp5580681 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content66% 
IMG OID639808686 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_955988 
Protein GI120406159 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.220934 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG TCCGCGAGAT CGACACCGGA AGCGTGATGA CGCGGTTCGC CCGCGGGTGG 
CACTGCCTCG GGCTGGCGGA CGCCTTCCGG GACGGGCGGC CGCACGGCGT CGACGCGTTC
GGCACCATGC TGGTGGTGTT CGCCGACACG GGCGGGTCGC TGCGAGTCCT CGACGGCTAC
TGCAGGCACA TGGGCGGCAA CCTGGCCCAG GGCGAGATCA AGGGCGACGA GGTCGCCTGC
CCGTTCCACG ACTGGCGCTG GGGCGGCGAC GGCAGATGCA AGCTGGTCCC CTACGCCAAA
CGCACCCCAC GCATGGCCCG AACCAGGGCC TGGCCCACCA CCGAGGTCAA CGGACAGCTG
CTGGTCTGGC ACGACCCTGA ACGGTCCAGC CCGCCGACCG AACTGATCCC GCCGACCATC
GCGGGTTACG ACGAGGGCCG CTGGTCGCCC TGGCAGTGGA GTTCGATCCT CATCGAGGGC
GCCCACTGCC GCGAGATCGT CGACAACAAC GTCGACATGG CGCACTTCTT CTATATCCAC
CACGCGTACC CGACGTACTT CAAGAACGTC ATCGAGGGAC ACACGGCCAG CCAGTTCATG
GAGTCCAAGC CGCGTCCCGA TTTCACCGCG AACCCCGAGA AGCTCTGGGA CGGAACGTAT
CTGCGATCCG AGGCGACGTA CTTCGGGCCG GCGTACATGA TCAACTGGCT GCACAACGAC
CTCGCACCGG ACTTCACCGT CGAGGTGGCG CTGATCAACT GCCACTACCC CGTCAGCCAC
AACTCGTTCA TGCTGCAATG GGGCGTGGCG GTGCAGGAGA TGCCGGGCCT GCCCGCCGAC
AAGGCGGCCA AGCTGGCCGC GGCGATGAAC CGGTCCTTCG GCGAGGGCTT CCTCGAGGAC
GTCGAGATCT GGAAGAACAA GTCCCCTATC GAGAATCCGC TGCTGACCGA GGAGGACGGA
CCGGTCTACC AGCACCGCCG GTGGTACCAG CAGTTCTACG TCGACGCAGC CGACGTGACC
GCCGACATGA CCGGCCGGTA CGAGCAGGAA GTCGACACCA CCCACGCGAA CGACCTGTGG
CAGCAGGAGG TCGAGCGCAA CATGGCGGCC CGGAAGCCGG GTTCGGTTTG A
 
Protein sequence
MTDVREIDTG SVMTRFARGW HCLGLADAFR DGRPHGVDAF GTMLVVFADT GGSLRVLDGY 
CRHMGGNLAQ GEIKGDEVAC PFHDWRWGGD GRCKLVPYAK RTPRMARTRA WPTTEVNGQL
LVWHDPERSS PPTELIPPTI AGYDEGRWSP WQWSSILIEG AHCREIVDNN VDMAHFFYIH
HAYPTYFKNV IEGHTASQFM ESKPRPDFTA NPEKLWDGTY LRSEATYFGP AYMINWLHND
LAPDFTVEVA LINCHYPVSH NSFMLQWGVA VQEMPGLPAD KAAKLAAAMN RSFGEGFLED
VEIWKNKSPI ENPLLTEEDG PVYQHRRWYQ QFYVDAADVT ADMTGRYEQE VDTTHANDLW
QQEVERNMAA RKPGSV