Gene Mvan_0421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0421 
Symbol 
ID4647796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp458056 
End bp459417 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content73% 
IMG OID639803929 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_951275 
Protein GI120401446 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.659974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTGCAC ACCCGTGGGC GCTGGCAGCC GCGGTCGCGC TGCTGGCGGC CAGTGCGCCG 
CCGGCCGGCG CCATCACCCC GCCCGAGGTG GACCCTGCCG TGCCCCCTCC GGGCGGCAGC
GTGGGGCCGG TCGGGGCGAT GGCGCAACGC AATCCGTGTG TCATCAGTAC CGCGCTCCCG
GGCACCGACC CCGGCGCCGC AACCCCGGAT CGGACTGCGC TGCGCCTGTC CGAGGCCTGG
ACCCACAGCC GAGGTGAGGG GCAGACCGTC GCGGTACTCG ACACCGGGGT CAAGCCCGGA
CCGCGGCTCC CCGACGTCGA GGCCGGAGGT GACTACGTCG CCTCCGGTGA CGGCCTGACC
GACTGCGACG GGCAGGGCAC GCTCGTCGCG GGACTGATCG CCGGACAGCC CGGCGCGGAC
GGTTTCTCCG GAGTCGCGCC CGCCGCCCGC ATCCTGTCGA TCCGGGTGTC CTCGCCGCGG
TACGCGCCGC GGGATGCCGG CGAGGACCCG GCCGTCACGC GCGCGATGCT CGAGACGGAG
GCGATGGCCG GCGCGATCGT GCGCGCGGCG GACCTTGGTG CCCGCGTCAT CAACATCTCC
GCCGTGACGT GTGTGCCTGT CGGCGAGAAC TTCGACCAGA GCGGCCTCGG TGCGGCGCTG
CGGTACGCCG CGGTCGACAG GGACGTCGTG ATCGTCGCCG CCGCGGGCGA GGGCGGCGCT
GCCGGTGGTT GCGATTCCAA CCCGCTGTCC GACCCGGCGC TGCCGTCGGA TCCGCGGAAC
TGGTCGGGGG TCACCGCCGT GGCGATCCCG GCGTGGTGGC AGCAGTATGT GCTGTCGGTC
GGGTCGCTCG CTCCCGACGG CACGCCATCG TCGTTCACAA TGGCCGGGCC GTGGGTCGGC
ATCGCCGCGC CAGGCGAGGA CATCACTTCG GTGAGCAACG ACGAGGCCGG TGGGCTCGCC
AACGGCCTGC CCGGCGACCG GGACCGGATC GACCCGGTCC GTGGCACCGG CTATGCGACG
GCGTACGTCT CCGGCGTTGC GGCGTTGGTG CGCAGCAGGT TTCCCGACCT GACCGCGCGG
CAGGTGATCG AGCGTCTCAC CGGCACCGCG CAGTCGGCGG CCAGATCCCC GTCGAACCTG
GTCGGCGCAG GACGCATCGA CCCGGTCGCG GCGCTGACCT GGAATGTGCC CGCCACCGAA
GAGATCGGTA CCACGGCGGC CAGGCCGGTC GCCGCTCCGG CACCACCGCC GCCGAAGGAT
CCGGTCCCGC GCGCAGTCGC GTTCGCCGGT GCAGGAGTGT TGGCGCTCGT CGTGCTCACC
GTGTCCCTGA TGAGCACGAG AAGGAAGGAG ACGTCGTCAT GA
 
Protein sequence
MRAHPWALAA AVALLAASAP PAGAITPPEV DPAVPPPGGS VGPVGAMAQR NPCVISTALP 
GTDPGAATPD RTALRLSEAW THSRGEGQTV AVLDTGVKPG PRLPDVEAGG DYVASGDGLT
DCDGQGTLVA GLIAGQPGAD GFSGVAPAAR ILSIRVSSPR YAPRDAGEDP AVTRAMLETE
AMAGAIVRAA DLGARVINIS AVTCVPVGEN FDQSGLGAAL RYAAVDRDVV IVAAAGEGGA
AGGCDSNPLS DPALPSDPRN WSGVTAVAIP AWWQQYVLSV GSLAPDGTPS SFTMAGPWVG
IAAPGEDITS VSNDEAGGLA NGLPGDRDRI DPVRGTGYAT AYVSGVAALV RSRFPDLTAR
QVIERLTGTA QSAARSPSNL VGAGRIDPVA ALTWNVPATE EIGTTAARPV AAPAPPPPKD
PVPRAVAFAG AGVLALVVLT VSLMSTRRKE TSS