Gene Mvan_0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0035 
Symbol 
ID4644816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp43303 
End bp44505 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content59% 
IMG OID639803546 
Producthypothetical protein 
Protein accessionYP_950892 
Protein GI120401063 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.89444 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAAT CTGTTCGACT GGGCGATCTC ATCTCTGTCA AGCATGGCTA TGCCTTCCCC 
GGCGAGGGGT TCACAGAAGA CCCGACGTAT CCAATCTTGG TCACACCTGG GAATTTCGCG
ATCGAAGGCG GATTCAAGGA ATCGAAACCA AAGACGTTCA ACGGCGACTA CCCACCAGGG
TTCGAACTGG CGCCGGGCGA CTTAGTGGTT TCAATGACCG ACCTTAGCCG CGACGGTGCG
ACCCTCGGTA TGCCGGCGCT GATCCCGGCT GGCCCGACCT ATTTGCACAA TCAGCGGATC
GGACTTATCG AGGCGATCGA TCGATCGAAG ATTGATCGGC TTTTCCTCAA CTATTACCTG
CGCACGGCCG CTTATAGATC CCACATTCTA GGCACTGCTT CGGGATCGAC AGTTCGTCAT
ACGAGCCCTA GCCGTATCGA GGATTTTGTT GCGCTTCTGC CTGGGCTTCT GGAGCAGCAA
GCGATAGGAG CGATTTTGGG ATCGCTCGAC GACAAAATTG GCGTGAATCG CAGACTGGCG
AATGTCGGTC GGCTCCTTCA GTCGGAACTC TGGCATCGTG CTGCAACGGG GAGCCGTCAG
GTGTCCCTAG GGTCGTTGGT GCGGCCTCAT CTTGGCGGTA CACCATCACG TTCCGATAGC
AACTTATGGG CAGGTGACGT TCCGTGGGCG TCTGTTCGCG ACATGTCTGC TGCGGACGGT
GGCGTTCTCT TAGCTACTGC CGAGACGATC AGCTCGGCCG TTTCTCAGTC AGTCGGTCGC
CTCGCTGCCC TACCAGAGCG ATCAGTTGCC CTGACTGCAA GGGGCACGGT TGGCAAAGTC
GTGACTCTGG GAGTAGCAAG CGCGATCAAC CAGTCGGCAT ACGGCTTTAT TCCGCCGGCA
GGACGGGGGG TGGCGTTACG GTGCGCACTG GAGTCGATTT CCGATGAGCT GAAGGCGCGT
GCACACGGCT CGGTATTCTC AACAATCACG ATGTCGACGC TCGAGAGCGT ACGTGTCCCG
GCGATCAACG AGACAGACTG GGACGGGGTA TGCGAGTCAC TTGAGTTGAT CGAAGATCGT
AGACTGTCAG CCCTCCGGGA GACTCGGGTG CTCGCCCGCA CGCGAGACGA ACTCCTCCCA
CTGCTCATGT CCGGCAGAAT CCGCGTCAAG GATGCCGAAG CCCGCGTGTC GGAGGTGGTG
TGA
 
Protein sequence
MRESVRLGDL ISVKHGYAFP GEGFTEDPTY PILVTPGNFA IEGGFKESKP KTFNGDYPPG 
FELAPGDLVV SMTDLSRDGA TLGMPALIPA GPTYLHNQRI GLIEAIDRSK IDRLFLNYYL
RTAAYRSHIL GTASGSTVRH TSPSRIEDFV ALLPGLLEQQ AIGAILGSLD DKIGVNRRLA
NVGRLLQSEL WHRAATGSRQ VSLGSLVRPH LGGTPSRSDS NLWAGDVPWA SVRDMSAADG
GVLLATAETI SSAVSQSVGR LAALPERSVA LTARGTVGKV VTLGVASAIN QSAYGFIPPA
GRGVALRCAL ESISDELKAR AHGSVFSTIT MSTLESVRVP AINETDWDGV CESLELIEDR
RLSALRETRV LARTRDELLP LLMSGRIRVK DAEARVSEVV