Gene Mvan_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0040 
Symbol 
ID4644894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp51521 
End bp52831 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content69% 
IMG OID639803551 
Producthypothetical protein 
Protein accessionYP_950897 
Protein GI120401068 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.480541 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCGA CCGTGGTTGA TTCGCCTGCT GGGTGCACTC AGTCGGAGCG GCTTGAGGTG 
TTGTTCGAGG AGCTTTCGGA GTTGGCGGGG CAGCGCAATG CCATTGACGG GCGGATCGTG
GAGATCGCCG CGCAGATCGA CCGTGACGGG CTTGTCGGGA TCACCGGGGC GCGGTCGGTG
GCGGCGTTGG TCGCGTGGAA GACCGGGTGC TCGCCGCACA ACGCCAAGAC GATCACCACG
GTCGCCGACC GGCTCGAGGA GTTCCCGCGC TGCGTGACGG CCTTGGGTGA GGGCCGTCTT
TCCCTCGACC AGGTCGCGGT GATCGCCGAA CACGCCGGCC AGGGCTCTGA TGCCCATTAC
GCGCACCTGG CGGCGAGCGC TTCGGTCAGC CAGTTGCGCA CCGCGGTCAA ATCCGAACCC
CGACCCGATC CCGAACCGGT GCGCGATCCG TTCGCCGACG CGGACGAGGA ACCCGAACCC
GCTCCGGTGC CCAGGCCGGA GATCACCAGC ACCTGCGACG CCGACTACAC CTACTGGCGG
ATCAAGCTGC CGCATGAGCA GTCCGCGAGA TTCACCGCGG CCCTGCAGTC GCATAAAGAC
CGGCTGATCG CCCAACACAC TCGCGACCAC GGCACCGACC CCACCGGCGA CGGTGACGGT
GACCGTGGGG TGCAGCTGCC GCCGTGGCCG AGCGCCGGTG AGGCGTTCAT GGAGCTTGTC
GAGGCCGGCT GGGATGCCGA AGCCACCCGC CGCCCGCACG GTCAGCACAC CACCGTGGTC
GTGCACGTCG ATATCGACAC GCGGGTCGCC GCCCTGCATC TGGGTCCGCT GCTCACCGAC
GAGGAACGTC GCTTTCTGCT CTGCGATGCC ACCTGTGAGG TCTGGTTCCA ACGCCACGGC
CGGCCCCTCG GCACGGGACG GTCCACCCGC ACGATCAACC GCCGCCTGCG CCGTGCCCTC
GAGCACCGCG ACCGCACCTG CGTGGTCCCC GGCTGCGGCG CGACCCGCGG CCTGCACGCC
CATCACCTCG TGCACTGGGA AGACGGCGGC GACACCGAAC TCGACAACCT GGTCCTGGTC
TGTCCCTACC ACCACCGAAC CCACCACCGC GGCCTGATCA CCATCACCGG ACCCGCCCAC
CAACTGCTCG TGACCGACCA CACCGGCCGA CCACTGCAAC CGGGGTCACT GGCGCGGCCC
CCGACCACAC CACCACCAGA GGTCAACCCC TACCCCGGAA CCTCGGGAGA ACGCGCCCAA
TGGAAGTGGT ACCACCCCTA CCAACCCCCA CCACCAACGA GCAACAACTA G
 
Protein sequence
MSSTVVDSPA GCTQSERLEV LFEELSELAG QRNAIDGRIV EIAAQIDRDG LVGITGARSV 
AALVAWKTGC SPHNAKTITT VADRLEEFPR CVTALGEGRL SLDQVAVIAE HAGQGSDAHY
AHLAASASVS QLRTAVKSEP RPDPEPVRDP FADADEEPEP APVPRPEITS TCDADYTYWR
IKLPHEQSAR FTAALQSHKD RLIAQHTRDH GTDPTGDGDG DRGVQLPPWP SAGEAFMELV
EAGWDAEATR RPHGQHTTVV VHVDIDTRVA ALHLGPLLTD EERRFLLCDA TCEVWFQRHG
RPLGTGRSTR TINRRLRRAL EHRDRTCVVP GCGATRGLHA HHLVHWEDGG DTELDNLVLV
CPYHHRTHHR GLITITGPAH QLLVTDHTGR PLQPGSLARP PTTPPPEVNP YPGTSGERAQ
WKWYHPYQPP PPTSNN