Gene Mvan_4103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4103 
Symbol 
ID4648713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4396021 
End bp4397382 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content66% 
IMG OID639807570 
Producthypothetical protein 
Protein accessionYP_954886 
Protein GI120405057 
COG category[R] General function prediction only 
COG ID[COG4908] Uncharacterized protein containing a NRPS condensation (elongation) domain 
TIGRFAM ID[TIGR02946] acyltransferase, WS/DGAT/MGAT 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.612164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGT TCATGCGCAA CAGCGATGCG TTCACCTGGG CCATGGAGAG TGATCCCCGG 
CTGCGCTCGA CGGTGGTCAC GCTGATCCTG CTCGACAAGT CGCCGGACTG GGACGAAGTA
CGGGATCGCT TCGACATCAT CGCCCGGGCG GTGCCGATGC TGGGACAGCG GGTCGTCGCG
TCTCCGCCTC CGGCGCCGCC GAGGTGGGAG CAGGCCCCGG ACTTCGACCT CGACTTCCAT
ATGCGCAGAG TGACCGCGTC GGCCCCCGGC ACCGTCGACA CGCTGTTGGA GATGGCGCGG
GTCGCGGCGA TGGAGGACTT CGACCGGGCC CGCCCGCTCT GGGAGGCCAC TCTGGTCGAT
GGTCTCGACA ATGACGGCGC CGCAATGATT CTCAAGTTCC ATCATGCGTT GACCGACGGC
GTGGGCGGCA TCCAGATCGC GATGATCCTC TTCGATCTGT CGGCGGCCCA TGAGCAACCC
GCGCCGGTGG CGGCACTGCC CGAGGTACCC CCGCCGCCGT GGCTGCGTAG CTATCGCGAC
ACCGCCCAAT ACGGGGCCGG TTTGGTGGGC AACGCCCTGA TGGGCGCCCT GAAGATGGCC
CCGAAGCTGG TTTCCCGCAG CATTCTGCGG CCTGTGGACA CCGTGTCGTC GGCAGCCGAG
CTGGCCGCGT CGGTCTACCG GACCATGGCG CCGGTGAACC GGAGGGGCTC GTCGCTGATG
AGCAAGCGCA GCCTCATCCG CCGGCTCGGT GTGCACGAGG TCCCGTTCGC AGCGCTGCGG
GCGGCCGCCC ATCGCGGCGG TGGTGCGTTG AACGACGCGT TCGTCGCCGG CGTGGCCGGC
GGGTTGCGCC GCTACCACGA AGCACATGAC GTGACGGTTG GAGATCTTCA CCTGACCATG
CCGATGAGTC TGCGCGAAGA GGGCGACGAG ATAGGCGGCA ACCGGATCAC CATCATGCGG
TTCGACGTAC CGGTCGGTGT CGCCGATCCG GTGGAGCGGA TTCGCCAGGT GCACGAGCGA
ACCGGCAGGG TGCGTCACGA GAAGTCGGTG CCGTACACGC AGTGGATCGC CGGTGTCCTG
AATATGATGC CGCGGTGGTA CATCGGATCG ATGCTGCGCA ATATCGACTT TTTGTGTAGC
GACGTGCCGG GGATTCCGGT GCCGGTGTTC CTCGGCGGCG CGAAGGTTCT CGCCCAATAC
GGTTTCGGCC CGACCATCGG ATCAGCGGTC AACGTGACGT TGTTGACCTA CGTGGACGTG
TGCACGCTGG GCATAGACGC GGACACCGGC GCGATTCCCG ACTTCGAGGT GTTCTTCGAC
TGCCTCGTCG CCGGCTTCGA CGAGGTGTTG GCACTGGGCT GA
 
Protein sequence
MTEFMRNSDA FTWAMESDPR LRSTVVTLIL LDKSPDWDEV RDRFDIIARA VPMLGQRVVA 
SPPPAPPRWE QAPDFDLDFH MRRVTASAPG TVDTLLEMAR VAAMEDFDRA RPLWEATLVD
GLDNDGAAMI LKFHHALTDG VGGIQIAMIL FDLSAAHEQP APVAALPEVP PPPWLRSYRD
TAQYGAGLVG NALMGALKMA PKLVSRSILR PVDTVSSAAE LAASVYRTMA PVNRRGSSLM
SKRSLIRRLG VHEVPFAALR AAAHRGGGAL NDAFVAGVAG GLRRYHEAHD VTVGDLHLTM
PMSLREEGDE IGGNRITIMR FDVPVGVADP VERIRQVHER TGRVRHEKSV PYTQWIAGVL
NMMPRWYIGS MLRNIDFLCS DVPGIPVPVF LGGAKVLAQY GFGPTIGSAV NVTLLTYVDV
CTLGIDADTG AIPDFEVFFD CLVAGFDEVL ALG