Gene Mvan_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2020 
Symbol 
ID4645343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2159241 
End bp2160272 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content66% 
IMG OID639805505 
Productputative oxidoreductase 
Protein accessionYP_952843 
Protein GI120403014 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03617] probable F420-dependent oxidoreductase, MSMEG_2256 family 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGCTG GAGGCGGCCT GAAGGTGGAC GCGGCCGTCG TCAGTCAACT GTCGCAGGTC 
CCGGCCGCGG CCAGGAGGCT GGAACGGCGG GGCTACGACG GTTGCTGGAC GGCCGAGATC
AACCATGATC CGTTCCTGCC GCTGACGTTG GCGGCCGAGC ACACCCACTC CATCGAGCTG
GGCACCAGCA TTGCGGTGGC CTTCGCCCGA AATCCGATGA CGGTCGCCCA GATCGGCTGG
GATCTGCAGG ACTATTCGGG GGGCCGATTC ATCCTGGGAC TGGGTTCGCA GATCAAACCG
CACATCGAGA AGCGGTTCAG CATGCCGTGG AGCAAGCCCG TCGGCAGGAT GCGGGAGTTC
GTGCTGGCGC TGCGCGCGAT CTGGATGAGC TGGAGCGACG GCAGCCGGCT GGAATTCGAC
GGCGAGTTCT ACACCCACAA GTTGATGACG CCGATGTTCG TCCCGCCACG ACATCCTTAC
GGCGACCCCA GGGTGTTCGT CGCCGCCGTC GGGGACCGGA TGACCGAGAT GTGCGGCGAG
GTGGCCGACG GATTGCTCGC ACACGCCTTC TCGACGCAGC GCTACGTCCG GGAGGTCACC
ATCCCGACGT TGACGCGGGG GATCGAGCGG GCCGGACGCA CGCGCGCCGA CATCGAGGTG
GCCAGCCCAC TGTTCATCGT GACCGGCCTC GACGAGCAAC AGATGGCGGC GGCCGCGGTG
GCGACCCGCA AGCAGATCGC CTTCTACGCC TCCACCCCGG CCTACCGCAG CGTGTTGGAG
TTGCACGGCT GGGGTGATCT GCAGACCGAA CTCCACAGCC TCTCGCGTGA GGGTGACTGG
GACACGATGG GCTCGCTGAT CGACGACGGG ATGCTCGCGG AGTTCGCAGT CGTGGCTCTG
GTCGACGACG TGGTGGACAA GATCCGCGCG CGCTGCGACG GCTTGATCGA CCGTGTGCTG
GTGGGCTTTC CACCGTCCAT CGACGAGGCG ACGGTCGTCG ACCTGGTCAC CGATTTGCGC
ACTGGCGCAT GA
 
Protein sequence
MGAGGGLKVD AAVVSQLSQV PAAARRLERR GYDGCWTAEI NHDPFLPLTL AAEHTHSIEL 
GTSIAVAFAR NPMTVAQIGW DLQDYSGGRF ILGLGSQIKP HIEKRFSMPW SKPVGRMREF
VLALRAIWMS WSDGSRLEFD GEFYTHKLMT PMFVPPRHPY GDPRVFVAAV GDRMTEMCGE
VADGLLAHAF STQRYVREVT IPTLTRGIER AGRTRADIEV ASPLFIVTGL DEQQMAAAAV
ATRKQIAFYA STPAYRSVLE LHGWGDLQTE LHSLSREGDW DTMGSLIDDG MLAEFAVVAL
VDDVVDKIRA RCDGLIDRVL VGFPPSIDEA TVVDLVTDLR TGA