Gene Mvan_4184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4184 
Symbol 
ID4648335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4491233 
End bp4492504 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content62% 
IMG OID639807651 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_954967 
Protein GI120405138 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGT GGCCCAAGCC CGTCGAGGGC AGTTGGACGC AGCACTATCC GGACCTCGGC 
ACCGGACCGG TGTCGTTCCG GGACTCGACG TCGCCGGAGT TCTATGAGCT GGAGCGGGAA
GCCATCTTCA AACGTGCCTG GCTCAACGTC GCGCGTGTCG AGGAGCTGCC CCGGGCGGGC
AGTTACCTGA CCAAGGAGAT CGAGGTCGCG AAGACGTCGG TCATCGTGGT CAAGGGTCGC
GACGAACAGA TCCGCGCGTT CCACAACATC TGTCGGCACC GCGGAAACAA GCTGGTGTGG
AACGACTATC CGAACGAAGA GGTCAAGGGC ACCTGCAGGC AGTTCACCTG CAAGTACCAC
GGCTGGCGGT ACAACCTCAC CGGCGATCTG ACGTTCGTCC AGCAGCCTGG TGAATTCTTC
GGTCTCGACG AGAAGGACTA CGGGCTGGCG CCCGTGCACT GTGATGTGTG GAACGGTTTC
GTCTTCATCA ATTTCGATCG GGAGCCGCGT CAGACACTGC GGGAGTTCCT GGGGCCGATG
ATCACCGCGC TGGACGACTA CCCGTTCGAG TCGATGACCG AGCGCTACGA CTTCGTCGCA
CACAACAACA GCAACTGGAA GATCTTCGCC GACGCGTTCC AGGAGTACTA CCACGTACCG
TCGCTGCATT CGCAGCAGGT GCCGTCCGCG GTCCGCCAGC CCAACGCGAC GTTCGAATGC
GGCCATTTCC AGATCGACGG GCCGCACAGG CTGGTCTCCA CGGCCGGTAC CCGTCGCTGG
CTGCTGGCGC CGGAATACAT GTATCCCGTC GAAAGGGTCA CCCGCAGCGG GTTGGTCGGC
CCCTGGCGGA CACCGGAGAC GCATCAGTCC GCCGGCCTGA ACCCGGGCGG CATCGAACCG
TGGGGCATCA CCAACTTCCA GATCTTCCCG AACCTGGAAA TTCTCATCTA CCACGGCTGG
TACCTTCTGT ACCGCTACTG GCCCACGTCG CACAGCACGC ACAAGTTCGA GGCGTACAAC
GCATTCCATC CCGCCCGCAC CGTCCGGGAA CGTATCGAGC ACGAAGTCGC CTCGGTGGTT
CTCAAAGAGT TCGCCCTGCA GGACGCGGGC ATGCTCGGTG GCACCCAGGC GGCGCTCGAG
TACGGCCTGG ACGAACCGAT AGTCGACGAC TACCCACTCA ACGACCAGGA GATCCTGGTT
CGGCATCTGC ACAAGACAGC CGTCGACTGG GTGGAGGAAT ATCAGAACGA GCGTCGACCG
GTGGGGGTGT GA
 
Protein sequence
MAKWPKPVEG SWTQHYPDLG TGPVSFRDST SPEFYELERE AIFKRAWLNV ARVEELPRAG 
SYLTKEIEVA KTSVIVVKGR DEQIRAFHNI CRHRGNKLVW NDYPNEEVKG TCRQFTCKYH
GWRYNLTGDL TFVQQPGEFF GLDEKDYGLA PVHCDVWNGF VFINFDREPR QTLREFLGPM
ITALDDYPFE SMTERYDFVA HNNSNWKIFA DAFQEYYHVP SLHSQQVPSA VRQPNATFEC
GHFQIDGPHR LVSTAGTRRW LLAPEYMYPV ERVTRSGLVG PWRTPETHQS AGLNPGGIEP
WGITNFQIFP NLEILIYHGW YLLYRYWPTS HSTHKFEAYN AFHPARTVRE RIEHEVASVV
LKEFALQDAG MLGGTQAALE YGLDEPIVDD YPLNDQEILV RHLHKTAVDW VEEYQNERRP
VGV