Gene Mvan_3338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3338 
Symbol 
ID4644384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3553087 
End bp3554274 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content73% 
IMG OID639806815 
Producthypothetical protein 
Protein accessionYP_954141 
Protein GI120404312 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.33185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.571531 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTCGA TCACCATTGA CGAACGGCGC GCCCGGCTGG CCAGGCGCCA CCACCTCGCC 
CCCGGCCCGG ACGGTGCCCC GTCGGTGCAC GCGGTGACCG GCAGGCTGGT GGGGCTGCAC
GCGACGGATC CGGCCACCCC GCACCTGTCG CTGTGGGCGC GGCTGCCCGG ATACACGGTG
GCCGATCTGA ACGCCGCTCT CTACGAACGG CGTTCGGTCG TCAAGCAGCT GGCGATGCGG
CGCACGCTGT GGGTGATACG TGCCGAGGAT CTTCCCGCCG TGCAATCGGC CGCCGGCGAC
CGGGTGGCCA CCAACGAGAC CCGCCGACTG GCGGCCGACG CCCAGAACGC AGGGGTGGCG
CGGGACGGAC ACGCGTGGCT GGAGGCGGCG TGCGCGGCGG TGGTGCGCCA TCTGCGTGGC
GCAGGGCCGT GCACGGCACG CGAACTTCGA GAGGCGCTGC CCGAGCTGAC CGGCACCTAC
GACCCCGCTC CGGGGAAGCC CTACGGCGGT GAAGGGCACC TGGCGCCCCG GGTGCTGACG
GTGTTGTCAG CCCGCGGTTT GATCGTCCGC GGCCCCAACG ACGGTGGCTG GACCACATCG
CGTCCGCGCT GGGCCGCGGC GGGGTCGTGG CTGGGCCCCG CTGACCCCGT CTCAACGGAT
CAGGCCCGCG CAGAGTTGGT ACGGCGCTGG CTCCACGCGT TCGGGCCCGC CACCGTCGAC
GACCTCAAAT GGTGGTTCGG CGCCACCCTC GGCTGGGCCC GGCAGGCACT GTCCGATGTC
GACGCGGTCG AGGTTGCCGT GGAAGGTTCA GCCGGTACGT CAGGCTTCGT GCTTCCGGGC
GACGACGGTC CGGAGCCCGA TGTCGAACCG TGGTGTGCAC TGCTTCCCGG CCTGGACGTC
ACCACGATGG GCTGGGCGGG ACGCGACTGG TATCTGGGAC CGCACCGCGG CGCGGTGTTC
GACCGCAACG GCAACGCGGG CCCCACGGTC TGGGTGGACG GTCGCGTCGT CGGCGCATGG
CGTCAGGACG ACGACGGGCG GGTCGAGCTG ATGCTGCTCG AAGACGTGGG ACGCCCGGCG
CTCAGGGCGC TGACCGCGCG CGCCGAGGAG CTGACGGCGT GGCTCGGCGG TGTGCAGGTG
AAGCCGCGGT TCCCTTCCCC CGCGAGCAAG TCCGCGGCTA GGCGCTGA
 
Protein sequence
MRSITIDERR ARLARRHHLA PGPDGAPSVH AVTGRLVGLH ATDPATPHLS LWARLPGYTV 
ADLNAALYER RSVVKQLAMR RTLWVIRAED LPAVQSAAGD RVATNETRRL AADAQNAGVA
RDGHAWLEAA CAAVVRHLRG AGPCTARELR EALPELTGTY DPAPGKPYGG EGHLAPRVLT
VLSARGLIVR GPNDGGWTTS RPRWAAAGSW LGPADPVSTD QARAELVRRW LHAFGPATVD
DLKWWFGATL GWARQALSDV DAVEVAVEGS AGTSGFVLPG DDGPEPDVEP WCALLPGLDV
TTMGWAGRDW YLGPHRGAVF DRNGNAGPTV WVDGRVVGAW RQDDDGRVEL MLLEDVGRPA
LRALTARAEE LTAWLGGVQV KPRFPSPASK SAARR