Gene Mvan_6020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_6020 
Symbol 
ID4643946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp6425936 
End bp6427267 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content61% 
IMG OID639809486 
Productxenobiotic compound DszA family monooxygenase 
Protein accessionYP_956780 
Protein GI120406951 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.719059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCACC TCATCACCTG GCTAATGGGC AACAGCTATC ACGCCGCGGG GTGGCGACAT 
CAGTTGGCCT GGGAGCGCAC GGCGATGCGT TTGGACGTGT TGATCGAGAT CGCAAAGATC
GCCGAGGAGG CGAAACTCGA CGCGCTGTTC GTCGCCGACG GCAACGGTGT GCGGAATATG
GACAAGGTTG GCCTCTTTGA AGCCAACACC CCCTCCGCTC GCCCCACTGT CTACGATCCC
GTCACCCTGA TGGCGGCGAT CTCGCAGCAC ACCAAACACA TCGGACTCGT CGGTACGGCG
TCCACGACCT ACGAGTCGCC CTGGGTGGTA GCGCGGCGTT TCGCTTCACT CGATCATCTG
TCGAACGGTC GGGCATCATG GAACGTCGTC ACAGGGTCAA ACCCCGGAGA CTCGGAGAAC
TTCGGATTGG CACACCACCC GGATCGTGAC AGCCGATACG CGCGCGCTGA GGAATTCGTG
TCGGCATGCA AGGCGCTCTG GAACAGCTGG GACGAGGACG CGTTCGTCGA GCGAAAAGAC
ACGGGACAGT ACCTGAACGC CAGAAAAGTT CGCGTCCCCG ACTACAAGGG CCGTCACCTA
TCCGTGAAGG GCCCACTCAA CGTGTCGCGA TCCCCACAAG GCCGTCCTGT CTTGTTCCAT
GCGGGCCAGT CCGAAGGCGG AAGACGCTTG GCCGCCCGCC ACGCGGACTG CATATTTGAG
GCAGCGGCGA GTGTGGAGGC AGCGCGAGAG TTCTATGCCG ACCTGAAGCG CCGGAGGGTT
GAAATCGGCG GCGAACCCGA CCATTTGCGG ATTATTGTCT CGGTAGCCGT CTACCTCGGC
AGAACTGAAA GCGAAGCCGT CGAGCTGTAC TCCGAGCTGA ATTCGCTCAT CAGTCCCGAT
CTCGGCGTCG ACTTCTTGGC TAAAGCTGTC TCTGAGGATC TGACCGGTTA CCCGGTCGAC
GGTCCCATGC CGGACCTCAG CGCTCCCGTC GTCGGCGGCA ACTCCATACG GGGTCAACTC
GATGCAATCG CCAAAGCAGA ACAGCTCACG GTCCGGCAGA TGTACGAACG TGTCGTTCCC
ACGATGGGCA ATACGGCTCT GATCGGAACG GCCACACAGA TCGCCGACGT CATGGAGGAG
TGGTACACCA CCGGAGCGTG CGACGGCTTC GTGCTCGGCG CTTCCATCAG CCCGTTTACC
CTTCTTCTGA TCCGTGACGA ACTCGTTCCC GAACTGCAGC GTCGAGGACT GTTTCGACGG
CACTACACGG GAAGCACGCT GCGCGAGAAT CTTGGGCTTC CCCCCGTCGA CAACTCACCG
TTCGCAACCT GA
 
Protein sequence
MMHLITWLMG NSYHAAGWRH QLAWERTAMR LDVLIEIAKI AEEAKLDALF VADGNGVRNM 
DKVGLFEANT PSARPTVYDP VTLMAAISQH TKHIGLVGTA STTYESPWVV ARRFASLDHL
SNGRASWNVV TGSNPGDSEN FGLAHHPDRD SRYARAEEFV SACKALWNSW DEDAFVERKD
TGQYLNARKV RVPDYKGRHL SVKGPLNVSR SPQGRPVLFH AGQSEGGRRL AARHADCIFE
AAASVEAARE FYADLKRRRV EIGGEPDHLR IIVSVAVYLG RTESEAVELY SELNSLISPD
LGVDFLAKAV SEDLTGYPVD GPMPDLSAPV VGGNSIRGQL DAIAKAEQLT VRQMYERVVP
TMGNTALIGT ATQIADVMEE WYTTGACDGF VLGASISPFT LLLIRDELVP ELQRRGLFRR
HYTGSTLREN LGLPPVDNSP FAT