Gene Mvan_3100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3100 
Symbol 
ID4646856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3270943 
End bp3272133 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content68% 
IMG OID639806577 
Productalkane 1-monooxygenase 
Protein accessionYP_953908 
Protein GI120404079 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.445628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.18825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATTC AGCAGCGCTC ACCGCAGCGG CACAGCCGGG CACCGGTAGT TGCTCAGGCC 
TGGCGTGACC AGAAGCGTTA CGGCTGGCTG CTGGGCCTGG TCATCCCCAC GCTCGTGCCC
ATGTCGTGGG CAGCGGCAGC CTTGACGGGT GCGGGGGTGT TCTGGTGGTC CGGCCCCGCC
CTGATGTTCC TCGTGATCCC GACTCTGGAT TATCTGGTCG GTCCCGACGC CGACAATCCG
CCCGACAGCG CGCTCACGTG GCTGGAGAAC GACCGGTTCT ATCGCTGGGC CACCTACCTG
TACCTGCCTG CCCAGTACGT GTCCCTGATG CTGGCGTGCT GGTTGTGGAG CGGTGGCGGC
GGAGTGGCGA TGAGCGACGT CGACAAGGTC GGGCTGATGC TCACGATCGG TGGTATCGGG
GGTGTGGCGA TCAACATCGC CCACGAGCTC GGCCACCAGC GGGCGCGGTC GGAGCGCCGG
CTCAGCAAGA TCGCGCTGGC GCAGACCGGA TACGGTCACT TCTTCGTCGA ACACAACCGC
GGCCATCACG CCCGCGTCGC CACACCCGAG GATCCGGCCA GCTCACGCCT GGGTGAGAGC
ATTTACACGT TCCAGTTCCG GTCCGTCCTG GGCTCCCTGC GCTCGGCATG GAGGCTCGAG
CGCCGACGGC TGTCCCGGCA CGGGAAGTCG CCCTGGACAC TTCGCAACGA CGTGCTGAAC
TCCTGGCTCA TGACCGCGGC GCTGTTCGCG GTGCTGGTCG CCGGGTTCGG CGTGGAGGTG
CTGCCCTGGC TGCTGGGCCA GGCGGTCGTC GGGATCTGCT TGTTGGAGTC GATCAACTAT
CTCGAGCACT ACGGGCTGCG GCGGCAGCGC CGCGCCGACG GCACCTACGA GCAGGTCCGG
CCCTCGCACA GCTGGAACAG CAACTCGGTG ATCTCCAACG TCTTCCTGTT CCACCTGCAG
CGCCACTCCG ACCACCACGC CAACCCGCAT CGGCGCTACC AGGCTCTGTG CCACGCGGAC
GAGGCGCCCC AGCTGCCGTC GGGCTACGCG ACGATGGTGC TGTTGGCGCT GTTCCCGCCG
CTGTGGCGGC GCGTCATGGA CGGGCGGGTC CTCGCCCACT ACGGCGGCGA CATCCGGCTG
GCGGCGCTGA GTCCGCGCAA AGAACGTCGG CTATTGCGGC GGTACGGCTG A
 
Protein sequence
MPIQQRSPQR HSRAPVVAQA WRDQKRYGWL LGLVIPTLVP MSWAAAALTG AGVFWWSGPA 
LMFLVIPTLD YLVGPDADNP PDSALTWLEN DRFYRWATYL YLPAQYVSLM LACWLWSGGG
GVAMSDVDKV GLMLTIGGIG GVAINIAHEL GHQRARSERR LSKIALAQTG YGHFFVEHNR
GHHARVATPE DPASSRLGES IYTFQFRSVL GSLRSAWRLE RRRLSRHGKS PWTLRNDVLN
SWLMTAALFA VLVAGFGVEV LPWLLGQAVV GICLLESINY LEHYGLRRQR RADGTYEQVR
PSHSWNSNSV ISNVFLFHLQ RHSDHHANPH RRYQALCHAD EAPQLPSGYA TMVLLALFPP
LWRRVMDGRV LAHYGGDIRL AALSPRKERR LLRRYG