Gene Mvan_3995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3995 
Symbol 
ID4647604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4270455 
End bp4271837 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content65% 
IMG OID639807457 
Producthypothetical protein 
Protein accessionYP_954778 
Protein GI120404949 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCACTG GCAAGGACGA CGTCCGGTTC GGCGGATTCG CGGGACGGCA CGCGATCGTG 
ATCGGTGCGA GCACCACTGG CCTGGTCGCA TCAGCAGTAG CGGCAAGGCA TTTCGACACG
GTGACGACCA TCGAACGGGA CAGCTTGCCC GGCGATCCGG TGTGGCGCAA GGGTGCACCG
CAGTCGCACC ACGGCCATAT CCTGCTGCGC GGTGGGCAAA ATGTGTTCAA CCATTACTTC
CCCGGGCTGA CGGCCGATCT GTTGGCCAAG GGTTCGGTGG AGGTGGACAT GGCCAACGAC
ATCTGCTGGT ATCACTCGGG AGGCTGGAAG AAGCGGTTCG AGAGTGGGCT GACGATGCTG
TGCCAGACCA GAGGCTTCCT CGAACTGAAC CTCCGGCAAC GACTGTCGCA GGTGCCGAAT
GTGTCGGTGC TACAGAATGT CAAAGCCACG GGGTACCGGG CCAAGGGCAA CCGGATAATC
GGTGTCGAGC TCGATGACGA CACGTTCCTG GCGGCCGATC TGGTGATCGA CGCCAGCGGA
AGAAACTCCG AAACCCCGAA GCGCCTCTGT GCCCTGGGCT TCGGGGAGCC CGAAGTCAGT
GAGCTCCATG TCGATATCGG CTATTCGACG ACGCTGTTCA CTCCGCCGGC CGACGGTCGC
GACTGGAAGG CGATGCTGAT CCACTCGAGG CCGCCGGCCA CCCGGACCGC GGCGCTGTTG
CCCACCGAGG GCGGTCGCTG GATCGTGACC CTGCTCGGCT GGCAGGGCGA TTTCGCCGGG
GGAGACATCG CCGGTTTCCT GGAATGGACA AAGGGTCTGC CGGTTCCCGA CCTCTACGAG
GCGCTGAGCG AGGCGACCTG TGTCGACGAC GTGCATCGCT GGCGCTTCCC GGCGAATCTG
CGCCGCCACT ACGACCGGTT GGCTTCCGCG CCAGACGGTT TGGTGGTCAT CGGAGATGCC
AACACCTCCT TGAATCCGTT GTACGCGCAG GGGATGTCGC ACGGGGCCAT CGGCGCGAGC
ATCCTGGACC GGTGCCTGAC CGAGCAGCGG GCAGCCTCAG GCGCCGGGCG GATAGACGGC
TTCGCCAAGC GCTTCCACCG CGCCTACGGC CGATTCCTCG ACGAGTGCTG GTTCACCTCG
ACGGTGGAGG ACTACGGCGC GGTCTCCGCC GGCGGGGGTG GCCGGTTGGT GTCCAGGCTG
GCGAGCTGGT ACCTCGGAAA GGTGACGGAG ATGACCTGGC GGGACGCTGA GATCGCGCGC
GAGTTCATCG ATGTGATGCA CCTGCAGCGG GCGCCGAAGA CCCTTATGCG GCCGGCCGTC
GCGGTCAAGG CGCTGTTGCC GAGTTCACCA GCAGCACACC AGGTCCGACC GGTTCGTGCG
TAA
 
Protein sequence
MITGKDDVRF GGFAGRHAIV IGASTTGLVA SAVAARHFDT VTTIERDSLP GDPVWRKGAP 
QSHHGHILLR GGQNVFNHYF PGLTADLLAK GSVEVDMAND ICWYHSGGWK KRFESGLTML
CQTRGFLELN LRQRLSQVPN VSVLQNVKAT GYRAKGNRII GVELDDDTFL AADLVIDASG
RNSETPKRLC ALGFGEPEVS ELHVDIGYST TLFTPPADGR DWKAMLIHSR PPATRTAALL
PTEGGRWIVT LLGWQGDFAG GDIAGFLEWT KGLPVPDLYE ALSEATCVDD VHRWRFPANL
RRHYDRLASA PDGLVVIGDA NTSLNPLYAQ GMSHGAIGAS ILDRCLTEQR AASGAGRIDG
FAKRFHRAYG RFLDECWFTS TVEDYGAVSA GGGGRLVSRL ASWYLGKVTE MTWRDAEIAR
EFIDVMHLQR APKTLMRPAV AVKALLPSSP AAHQVRPVRA