Gene Mvan_3594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3594 
Symbol 
ID4647159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3822515 
End bp3825685 
Gene Length3171 bp 
Protein Length1056 aa 
Translation table11 
GC content69% 
IMG OID639807068 
ProductYVTN beta-propeller repeat-containing protein 
Protein accessionYP_954392 
Protein GI120404563 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01965] VCBS repeat
[TIGR02276] 40-residue YVTN family beta-propeller repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.207597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.593254 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGCATT CGACGTACAT CGGCCGCATC GGTGCACTGG CGTTTGCTCT GGGCGTGGGT 
GTGGGGCTCG GCGCCACACC CGCCGTCGCA TTTGCGGACG AGACCGGCAC CTCGGCGAGC
TCCCCCGCCG GTGAGAACAC GACATCCGGT CCGGGCACCG CAGCGGACGC CCCGACCGAG
ACCGACGTCG ACGAACAGGA CGAAGCAGCT GTCGACGAAC CGGAGCAGCC CGCCGAGGAG
AACGACGCAG ACGCCGTCGA AACCGTCCGG GACCGCCAGG ACCGGATCGT CGACACCGAC
CCGGAACCCG TGGCCGCATC CGACGATGCC CCCGAAACCG AACCGGAACC CGAGCCAGCT
GTCGACGAGG CCGTCACCGC GCCCGAAGAC GCACCGACCG GCGATCCCGC GCCCGACGTC
GCACCCCCCA CAGGCCCCGA AGCCGACACC GTGGCCGAGA TCCCGTCTCC TGGCGCGGAA
CCGGTAGAGG CCCCCGCCTC CACGTCAGTC ACACTCTCCA CGATCCTGTC CTCGCTGCTC
GCACCGCCTC GCACCCCGGA CGCGCCCTCC GAGTCACCGC TGTGGCTGGT GGTGGCCGCC
GCGCTGCGCC GCCAGCTGGA CTCCCCCACG ACATCCGGGG AAGCCGGCCT CAGTTCGACG
CTGCTCACCC CGGAAGAGGC CGAGGCGATC CGCCGGCTGG GCGACATCAC GACCGGGCCC
AACCCGACAG ACGTGGTCTC CACCGACACG CGCGCGTATG TCGCTCATCC GGGCAGCAAG
TCGATAACGG TGATCGACAC GGTGAACGGC ACTGTGCTGC AGACCATTTC ACTGTGGTCC
ACCCCGACGA AGCTGGCCCT GAGCCGCGGA GGTGGCCGCC TCTACATCAG CAACGCCCTG
GCAGGCACGG TCTCGGTGAT CAGCACAGCC ACCAACTCGG TCGTCAAGAC CATCCGCGTC
GGCGACACCC CGACCGGAAT CGCGGTGAGC CCGGGCGGCA CCCGGGTGTA CGTCGTCAAC
AGCGACGACG GCACGGTGTC GAAGATCAGC ACGCTGACCA ACAAGGTCGT CGGGACCGTT
TACGGTGTCG GCAAAGGTGT TTCGACGATC ACGGTCAGTC CCGACGGCGC GACGATCTAC
GCCCTCAACG GAACCACCGG CGAGATCTCC CACTTCTCAG CCGCCTCGCT GTTCGCCGGG
AAGATCACCG GCGTCACGCC GGGATCCCAG GGCATCACGT TCAGCACCGA CGGTTCCCGC
GTGTTCGTCG CCGACCTTGC CGGCAGCGTG AAGGTCATCG ACACCGACTC CCACGAGGTC
GTCGACAGTA TCATCCTCGC GACAGGTGCG CCGTTCGACC TGGCGGTCAG CCCGGACGGG
ACGACGCTGT TCGTGGCACG CAGCGGCGAC GGCAAGCTGT CGGTGTACGA CATCGCCACC
AAGACCGAAC TCACCTCGAT CGTCGCCAAC CCCTACCTGG TCGACGGTGC GCCGACGATC
AGCGCGAGTC CAGACGGCAC GCAGCTGTAC TGGACCGACT CGGGCAGTGA CCGGGTGCAT
GTGATCGCGC TTCTCGCCCC CAACGGCGAT CCGGTCGCCG GAACACCGGT CGTGAACGCC
CCCAACGCTT CCGGGGCGGT CACCGGATCG GTCACCGTTA CCGATCCGGA AGGCCGGCCG
CTGACCTACA CGGTCAGCAC CCCCGACAAG GGCGCCGTAA CCGTGTCCCG GGGTGAGAAC
GGCGACTTCG TCTTCACCTA CACCCCCACC GCCGCCGCAC GCCACGCCGC GGCCGCCGTG
AGCGGCGCGC CCGGCGCCGC CGTCGACACG TTCACCGTCA CGTTCTCCGA CGGCCGGCGC
GGGGTGGTCT CGGTCCCGAT CACCGTCACG ATCGCGCCTG CCAACGCGGC CCCGACCGCC
ACGGCCCGAT CGAGCCTGTC CTGGTTCAGC CCCAAGGTGT ACGGCACCGT CACCGCCAAG
GACGCCGACA AGGACACGCT GGGCTTCACC GCGTCACCCA CCGCCAAGGG CGGGACGGTC
ACGATCGACG CCGACGGCAA GTTCACCTAC ACGCCCACCG CGGCAGCCAG GCACGCCGCC
GCGAACGCCG GTGCCACCGA AGCCGATAAA CACGACACCT TCACCGTTCT CGTCGACGAC
GGACACGGCG GGGTCACACC GCTCACGGTG ACCGTGAAGG TCAAGCCCGG CAACGCCACC
CCGACAGCCA CCGTCCGAAC CAGTTCGTCA TGGTTCAGCG CCAAGGTGTA CGGCACCGTC
ACCGCCAAGG ACCTCGACAG GGACGGCCTG GCCTACACCG CGTCCGCCAC CGCCAAGGGC
GGGACGGTCA CGATCGACGG CAGGGGCCGG TTCACCTACA CCCCCAGCGA TGCCGCCCGC
CACGTCGCCG CGGCGGAGGC AGCGACGCGC GAGGACAAGC AGGACACGTT CGACGTCGTC
ATCGACGACG GGCACGGCGG CGTCGCCACC GTGGCCGTCA CCGTGAACAT CAAGCCCGCG
AACGCGGCGC CGACGGACGC GGCCACGGCG GACGTGTTCA CCAACCCGAA CAGCGGCGTC
ATCACAGGGA AGATCACCGC GGTCGACGCG GACGGCGACG CCGTCACCTA CCGGGCCGCT
CCGGCCACCA GCAAGGGAGT CGTGAGCATC GACGCCGACG GGACGTTCAC CTACGTCCCG
ACGGCGGAGG CCCGGACCGC CGCGAGCAGG CCGTTCGCGC CGTCCTGGGA CAAGAACGAC
CGGTTCCGGG TGACGGTCGA CGACGGCCAC GGCGGCACCA CCACGCTGAC CGTCCGGGTC
GCCATCGCGC CGCTCGGGCA TGTCAACCAG GCCCCCACCA ACGGCAGTTA CTCTGTAGGA
CAACCGAATC CGGCCAGCGG GAAGGTCGCC GGCACGGTGT CGGCGACCGA CCCCGAGCGC
GATGCCGTGA CCTTCGCAGG CTCCGGTGTC ACCGGCAACG GCAGTGTGAT CGTCGACGCC
GCAGGTGGTT TCGTCTATAC ACCGAGCGAC GAGGCACGCC ACCGGGCCGG CGCGGACGAC
GCCACGCAGG CCGACAAGCA GGACACGTTC ACCGTCACCG CCGTCGACTT CTACGGCGCG
AAGCTGGCCG TCCCGGTCAC CGTGACCATC GTCGGGTTGA CTGCTCTCTA G
 
Protein sequence
MGHSTYIGRI GALAFALGVG VGLGATPAVA FADETGTSAS SPAGENTTSG PGTAADAPTE 
TDVDEQDEAA VDEPEQPAEE NDADAVETVR DRQDRIVDTD PEPVAASDDA PETEPEPEPA
VDEAVTAPED APTGDPAPDV APPTGPEADT VAEIPSPGAE PVEAPASTSV TLSTILSSLL
APPRTPDAPS ESPLWLVVAA ALRRQLDSPT TSGEAGLSST LLTPEEAEAI RRLGDITTGP
NPTDVVSTDT RAYVAHPGSK SITVIDTVNG TVLQTISLWS TPTKLALSRG GGRLYISNAL
AGTVSVISTA TNSVVKTIRV GDTPTGIAVS PGGTRVYVVN SDDGTVSKIS TLTNKVVGTV
YGVGKGVSTI TVSPDGATIY ALNGTTGEIS HFSAASLFAG KITGVTPGSQ GITFSTDGSR
VFVADLAGSV KVIDTDSHEV VDSIILATGA PFDLAVSPDG TTLFVARSGD GKLSVYDIAT
KTELTSIVAN PYLVDGAPTI SASPDGTQLY WTDSGSDRVH VIALLAPNGD PVAGTPVVNA
PNASGAVTGS VTVTDPEGRP LTYTVSTPDK GAVTVSRGEN GDFVFTYTPT AAARHAAAAV
SGAPGAAVDT FTVTFSDGRR GVVSVPITVT IAPANAAPTA TARSSLSWFS PKVYGTVTAK
DADKDTLGFT ASPTAKGGTV TIDADGKFTY TPTAAARHAA ANAGATEADK HDTFTVLVDD
GHGGVTPLTV TVKVKPGNAT PTATVRTSSS WFSAKVYGTV TAKDLDRDGL AYTASATAKG
GTVTIDGRGR FTYTPSDAAR HVAAAEAATR EDKQDTFDVV IDDGHGGVAT VAVTVNIKPA
NAAPTDAATA DVFTNPNSGV ITGKITAVDA DGDAVTYRAA PATSKGVVSI DADGTFTYVP
TAEARTAASR PFAPSWDKND RFRVTVDDGH GGTTTLTVRV AIAPLGHVNQ APTNGSYSVG
QPNPASGKVA GTVSATDPER DAVTFAGSGV TGNGSVIVDA AGGFVYTPSD EARHRAGADD
ATQADKQDTF TVTAVDFYGA KLAVPVTVTI VGLTAL