Gene Mvan_1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1001 
Symbol 
ID4645786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1037042 
End bp1038184 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content67% 
IMG OID639804502 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_951845 
Protein GI120402016 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGTC TCAAGGACCC GGACTTCGAG CACACCGGCC CTGGGCGCCC GGCCGGACAC 
CTGTTGCGGC AGCGGTGGCA GCCGGTCTAC GCCTCCGAGG AGCTGGAAGC GGGACGCGCC
GTGCCGCTGA AGATCCTGCA CGAGGAGCTG ACCCTGTACC GGGGTGAGGA CGGCGTCGCC
CACATCGTCG CGGGCCGCTG CGCACATCGC GGGGTGCTGC TGGCGGTCGG CACCGTGGAA
GGCGACTGCG TGCGTTGCCG ATACCACGGC TGGCGGTACG ACGGCGCCGG TCAGTGCGTG
GACCAGCCGG CAGAGCGGCG AAGCTTCGCG GACAAGGTGC GCATCGCGAG CTATCCCGTC
GAGGAGTACT TCGGCTTCAT CTGGACCTAT CTGGGTGAGT CGCCGGTGCC GGAGCTGCCG
CGCTGGCCGG AACTGGAGGA GTACGGCCGC TTCCACGTCA TCGAACACCG GAAGTGGAAC
TACTTCCACG ATCTGGAGAA CACCGTCGAC GACGTACACC AGTACTGGGT GCACAAGACC
GGCATCTATC AGGACGACGG CAACGCCGGC CAGATCCCGG AGATGAGCGC TGAACTCGCC
GATTTCGGCC TCACCCAGAC CAGCACATTC AGCAACGGGT TCGTCCGCCG GCTCGCGCTG
CTGATGCCGA ACACCCTGTA CTTCAACTCG GGCGCCGGAG TGCTGCGCGG TTTCAAGAGC
TTCCTGTGGA ATGTGCCGAT CGACGACGAG AACCACATGA TGTTCTTTCT GTTCATCGCG
GCTCATCTGC CGCCCGACGT CGGCGCCCGG CTGGCGGCGG GCGTGCGGGA GGGCCGAAAA
TACCTGTCCC AGCTGCGGCC GGTCGACGAC ATCATCCGCG CCGTGCTCAG CGGCCGGGAA
CGCTGGGAGG ACATCGAGGA CCGCCCGGAC CAGGTGCTGA TCGAGGACGG TGTCGTCCTG
CTCGGCCAGG GGGTCCTGCC CGACCGCTCG CTCAACCGGC TCGGTAGCTC CGACGCCGCA
ATCATCCTGC TGCGCAGACT CTATGCGCGC GAACTGGCCG CGATCGAGGC CGGCCACCCC
CTGACGAAAT TCCCGACACC CGACGCCGCG GCGCTCACCC GGCTCGACAG CTCGACACCC
TGA
 
Protein sequence
MNSLKDPDFE HTGPGRPAGH LLRQRWQPVY ASEELEAGRA VPLKILHEEL TLYRGEDGVA 
HIVAGRCAHR GVLLAVGTVE GDCVRCRYHG WRYDGAGQCV DQPAERRSFA DKVRIASYPV
EEYFGFIWTY LGESPVPELP RWPELEEYGR FHVIEHRKWN YFHDLENTVD DVHQYWVHKT
GIYQDDGNAG QIPEMSAELA DFGLTQTSTF SNGFVRRLAL LMPNTLYFNS GAGVLRGFKS
FLWNVPIDDE NHMMFFLFIA AHLPPDVGAR LAAGVREGRK YLSQLRPVDD IIRAVLSGRE
RWEDIEDRPD QVLIEDGVVL LGQGVLPDRS LNRLGSSDAA IILLRRLYAR ELAAIEAGHP
LTKFPTPDAA ALTRLDSSTP