Gene Mvan_1088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1088 
Symbol 
ID4648498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1148572 
End bp1153473 
Gene Length4902 bp 
Protein Length1633 aa 
Translation table11 
GC content61% 
IMG OID639804588 
ProductType IV secretory pathway VirB4 components-like 
Protein accessionYP_951931 
Protein GI120402102 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTTTGC GCTTAGGTGC CGCCCTCGCG GAGTACTTGC GGCTTAATCT GAAGCCCGCC 
GAGCGCAATT TCTTTGTGTT GGTCGAGGGC GTCTCGTCAC ATGTTGCTGC CGGCATGGCA
AGCGAGTGGG ACGATGCCCT CCCGCGGCTG GCTGTCGCAG CGCCGGAACC ATCTCGATTC
GGCGCGTACG CATTGACGGA CGTGTCAGGC ACCCAGTTGA GAAATGCCGC TGGCTCAAAT
GGAGTTGTTC TCGTCCTATG CGATGGGGAA CAGGTTCCGG ACCGCCAAGG GATTAGCCTC
TTCGATTCAA TTTTCCCGAG CATCCTTCTT GACAAGCCGC AAGGCCTGAT CCTGCTCTGC
CAGCAGAAAC CAGTTGTTGA CCCCGACGGT CCCGTCAGGG CTGTTCGCGA TGCAATTGTT
CAGGCAGACA TTGCTATCCG TCCGAGCCCG AAGGCGGTCG CCGACTTCCT CGATGTTGTT
GCAGCGGGCA ACAGCCCTCT GGAAGCACTG CCGACTCTTG GTGCTTTCAC CGACAGCGTC
GCACGAGGAG ACCGTGCTGA CGCTGGGCGG ATTCTCGACA ATCTGCGTTT GGCTTCGAAG
AGGACAAGCG ACGATTTCCT TCGCCCAAGT GCATACGCAG ACTTCCGTAA ACGTGCTGAA
CGGGTTCTCG CAACGCGTCC CTCGTTGCGA GGCAAGAAGG CCGAAATCCA TGCGGCCGCC
GACTCAATAA TGAGCAGCCT GCAATCGGGA AGCTCAGACC TTCTCCGGGA ACTGACTTTC
CATGAGGCTC GCGAGATCTT CGAAAAACGA AGTGAATCGC TCACGGAAAC AGTGCTGCGC
GAGATGGCGA ACTACAGAGC AGCGCTGGAT CCAGAGAGTT ACGCGGTCGG GCTTCCTTGG
GATTCTTACG AAAGCTGCGC CCACAACCTC GGGCGTGGAG CCGACCAGCG TGCCGCCGCG
CAAGAGCTCT GTGATCTCGA CGACGGCCAA CAGAAGCAGC TGTTTCACAG GACTACTCGG
ACGAAACTCG AGCGGCTGCT CAGGGACAAG TCAGTCAATG GGAGCAAGCC TTCATGCCCT
GAGGCCGCAC TGGTGCGCAG CACCCAACAG CTGGGCACAC CGATCGCTCG GGTGCAACTC
CTCGCGCCTG CAGCACCTGC GCTCAACACC GCCTCGGCAA CCAACCGATC CGGCGCGGGA
CGAATCCTGA CCCTGGCCTG CGCCCGACTG CGACTCGGTG GCCTCATGAG ACGCTGGGAT
AGCCTCGGCG TAGAAGTCGA CGGTCTCTTG CTGAAGGCCG CCGACGACGA AGAAGACCTT
GGCGACGTGC TGGGCGCTTT CAGTGATGCA GGCCTTGCCG ATGGCACCCC ACTTGCGCTG
CTGCAGTTGA GGATCCACGA TGCGGACGGC ACGACAGTTC AGATTGACTG GCGGCCTGAT
CTTGATGATG CAGCGCTCCT CCGCGCAGTT CTCCTGTTCG CTTCCGATGC ACCCAGTCTC
ACGCTAGCGA CGTTAACTGA GCCGACGCTT ACCGAATTCT GCGGCCACGA ACAACCAGAG
CCGATCCACC CCGTTCCGGC AGCGCTCATG CCCCTCGCCC AGACTCTGCA TACAACGGCC
AAAACAGCGT TGGAACGAGG CCTCACGCCA GAGCTGTTAT CCAACTGGTC ACATGCTTGG
ACCGCTACGG TCAACGAACA ACACGAGAAA GATGCAGATA GCGCGATCGC GGAGACCCTG
ACGCTGGCCG GCGGAGTCCT CCTCAAAGGT GAATGCGCCG CGCTCACCGG GCTCGCTCCC
CTCAAGGCTG AGTGGCTTTC GCAACAGCTG ACCGCTCTGT GGGATTTGCT CCTCAATTCA
GCGGGGACTC CAGGAAAATC CGAAGTGCCC GATGCGGTTT CGGCCTCAAC CGGAATCGCC
AGCGCCACTG CGGCACATCA TCCCGCCCAC CTGAGGTTGC GAAATCAGGA CCAGCCTTTC
CTGCCCTCAA GCGAGGGACG AATCTGGAGT CTTTACGGCG GCAGCGCCAC GCGAGACCAG
AGCGGCTTCG CCGACGAGGC ACTCCGTTCA GTTATTACGC AGCTGTTGAC ACTTCAACCG
GAGACAGCCG GACACCTGCG TTGTGTGACT TGGGGCCCCG GGGCCGCTGA CCTAATGATT
GCCGAAGCGA CACGAATGAT TGGCGCCAAA GTCGGCCGAG CCGAGGTCAA GAAGGTCGAA
ATTTTCTGTG TCGGCGTTAC CGAAGAGTGC AGGCCCCAGT GGGCCACGCT AGCCGAAGCC
GACAAAAAGT TTCGCGCCGA ACGAGACGTG CTGCAGATTC GCTATATCGA CGACCTCCCT
ACCCTCAAGC GGATACTTCG CCCAGCAGAT GAGAGCCCGG CAGTCCATCT CGCGCTTGTT
ACGGGACTGA CTGAAGGGGG TAACCGGCCC CAACTCGAGA CCCCGGAGGT TCTGCCTCCC
GCCGAAGATC CGGACATTCT TTTCACACCT CGTGTCTGGC AACGTCCGAA GCAAGACAGG
CGCACACTCC TCATGCCGCC GACTGCATCG TCAAGCGGTC AGGCATGGCT TCGCCTACAA
AACGCCGTGG ACGAGACATG GCCTGACATG CAGGTCGAAC TTCGAGTTCC TGAAGTTCGA
ACCGGCACAG GCGCTATCCG AATCCAGCTC GAGCAGATCC ACGAAATAGC TTTGTGGGTG
GCCACACTGG ACCGTTATGC GACTCGCGAC AGCCTCGAGC AAGCACTCGG ACCCGGAAAC
GTCGCGATCC TCCACCAAGA GCGCCGACTC GGCGGGGACA GTCCGCTTTC CCTAGTGCTC
AGCCAGAAGG CCGGCGGCCC CGTCGACCGA GCGATAGGGC GCAGCCTCCG AGCCGCCGGA
ATCGTCGATA ACTCCGACAT CGCGCTGTCG ATCGGAACCG ATCTTCGAAA AGTTGCAAGT
CAGGGCTACG GAATCCTCGC CCTCCAGGCC GCCACCAGCG GCGCGGGGAT CAACGAACTC
GTCGGCCACG TTGTCGCGTT CAGCCTCCTT GCAACCACCG CTACGCCCTG GCCCCTCCCC
GCAGGATGCA GAGTCCTTCT CGTCAGCCTG GATGAGTACA GACATTGGTT CCCAACCAAA
AGAGCAGACC TGCTCGCAAT CGCCTTGGAC CCCCGTGAAG GTGGAGTCCA CGTCGCTGCC
ATAGAAGTAA AGGCGAGACG CAGCGACGAA GCTGATGCCG CGGCCGGCGC CCTCGATCAG
TTGATACAGA CTCTCTCCGC AACCCGATTC GCCGCATACC CAGAACCCGA CAGCATCAAC
AGCCGCCTTT GGCTCAATCG GATAACTGAA GCCGCCTACG CCGTCGCACG AGAGTCGCGA
TTCCGACTTG ACGCCGACGA GTTGGCAGCA CTCGAGGCCT TCCGACGTGG GAGAGGGACT
CTCGAATGGG CAGGCGTCGG TCTGGTGTTC GGGCCCAATG TCAAACCGCT GCAACGGATC
CAACAAAACC CTGTTGGCAA TGACATCGTG CCGATTGCCC TTCACAGCCT GAAGCTGACC
GAGGAGCTCC TCAGGGACGC CACGGCAACC GATCTGACCA AGCTCCGTAC CGTCGAGACG
GATCGCGCAC CGCTGGAAGG CACCCGACGG AGACGCCGTC CCGAAACCAA GCCTCCCGGT
GGAGATGAAC CTCCCAGAGG CGATTCTGGC GAAGACGACG AGCCGGACGG AGATGGGCCA
AACGAACAGG ATCCGACGCC CCCTCCGCGC CCGGAGCCAG AGGATGGCCC CAAGAAGCCC
GAGAGTGGCG ACGACCGCAT AGTCGTCACG CCACCATCGG CGCCACGGCC ATTCGTAGCA
CCGGTTTTGG GATGGGACGC CGTTAGCGAG GAAGAGATCC GCTGGCACCC CGCTGGAGCT
GGACAAGACG TGCTACAGAA CGGCCATGTC GAGATCTGGG GATCTTCAGG CATGGGTAAG
ACCCAGTTTG TGATGACGCT ACTTGCCCAG CTGTCGAGGC ACTCCGGTAC GCACTTCGGG
ATCGCTGACT TCAAGAACGA CTACAGCGAT GCCAACGGCT TCCCGGAATT CGCCGACGCG
GAGTTCCTCG ACCTGTGGGA GGAGGGTGCG CCGTACAACC CGTTGGCGCT CACCAATGAC
AGCGAGAGAA GCATCGAAAC GGCGGTAATT GAGCTTCGCG ACACCATCGA GGAAGCCACC
CGGTCTTTCA CTCGCATGGG TGTGCGTCAG AAAGCGAAGC TCAACAAAGC TTTAGCAGCT
GCTTACGCGA CGGGGAGAAG CGAAGGTCGT TGGCCGACGC TACGGACGCT AGACGAACAG
CTCGACGATG ATCTCGCCGG CGTAATGGGG GATCTGACGC GCCACCGCTT GTTCAAGGAA
GGGCCTCCGC TCGGTGACGT GATCGACCGC AACGTAGTAT TCGGGTTGTC GAAAATCCCC
GGAAATGGAC AGACGACGAT CTTGGCGGCC GGCTTCATTC TTTCGGCGCT TTTGTTGAGG
ATCCAGAATC TTCCGCCCGT ACCCAACACA ATTCGGTACG TGGGTGTCAT CGACGAGGCA
CACCGCGTCG CTGACTTCAA AGCAGTCCAG ACCATGATTC GCGAGGGCCG CTCAAAGGGC
CTCGCCGTCG TACTCGCCAC GCAGCAGCCC CTGGACCTCC AAGAAGTTAC GGGCGCGAAC
GCTCAGACGC GGATCTGTTT CGGCCTGCCT GACGCGACCT ACGCGACGAT GGCGGCGCGA
AAGATGCAGC CCGATAATCA TCGGCTCGCT GAACAAATCC GCACCCTCGG CGTCGGAGAG
GCTTATCTGA GCCTGCGCGG GTCTGCCCCG CGACTCGTAA GAATGGTTCA AGCCTACCGA
GACGCCGAGC GGTTGGGACT ACCACCACTG CGGCACATCT AG
 
Protein sequence
MSLRLGAALA EYLRLNLKPA ERNFFVLVEG VSSHVAAGMA SEWDDALPRL AVAAPEPSRF 
GAYALTDVSG TQLRNAAGSN GVVLVLCDGE QVPDRQGISL FDSIFPSILL DKPQGLILLC
QQKPVVDPDG PVRAVRDAIV QADIAIRPSP KAVADFLDVV AAGNSPLEAL PTLGAFTDSV
ARGDRADAGR ILDNLRLASK RTSDDFLRPS AYADFRKRAE RVLATRPSLR GKKAEIHAAA
DSIMSSLQSG SSDLLRELTF HEAREIFEKR SESLTETVLR EMANYRAALD PESYAVGLPW
DSYESCAHNL GRGADQRAAA QELCDLDDGQ QKQLFHRTTR TKLERLLRDK SVNGSKPSCP
EAALVRSTQQ LGTPIARVQL LAPAAPALNT ASATNRSGAG RILTLACARL RLGGLMRRWD
SLGVEVDGLL LKAADDEEDL GDVLGAFSDA GLADGTPLAL LQLRIHDADG TTVQIDWRPD
LDDAALLRAV LLFASDAPSL TLATLTEPTL TEFCGHEQPE PIHPVPAALM PLAQTLHTTA
KTALERGLTP ELLSNWSHAW TATVNEQHEK DADSAIAETL TLAGGVLLKG ECAALTGLAP
LKAEWLSQQL TALWDLLLNS AGTPGKSEVP DAVSASTGIA SATAAHHPAH LRLRNQDQPF
LPSSEGRIWS LYGGSATRDQ SGFADEALRS VITQLLTLQP ETAGHLRCVT WGPGAADLMI
AEATRMIGAK VGRAEVKKVE IFCVGVTEEC RPQWATLAEA DKKFRAERDV LQIRYIDDLP
TLKRILRPAD ESPAVHLALV TGLTEGGNRP QLETPEVLPP AEDPDILFTP RVWQRPKQDR
RTLLMPPTAS SSGQAWLRLQ NAVDETWPDM QVELRVPEVR TGTGAIRIQL EQIHEIALWV
ATLDRYATRD SLEQALGPGN VAILHQERRL GGDSPLSLVL SQKAGGPVDR AIGRSLRAAG
IVDNSDIALS IGTDLRKVAS QGYGILALQA ATSGAGINEL VGHVVAFSLL ATTATPWPLP
AGCRVLLVSL DEYRHWFPTK RADLLAIALD PREGGVHVAA IEVKARRSDE ADAAAGALDQ
LIQTLSATRF AAYPEPDSIN SRLWLNRITE AAYAVARESR FRLDADELAA LEAFRRGRGT
LEWAGVGLVF GPNVKPLQRI QQNPVGNDIV PIALHSLKLT EELLRDATAT DLTKLRTVET
DRAPLEGTRR RRRPETKPPG GDEPPRGDSG EDDEPDGDGP NEQDPTPPPR PEPEDGPKKP
ESGDDRIVVT PPSAPRPFVA PVLGWDAVSE EEIRWHPAGA GQDVLQNGHV EIWGSSGMGK
TQFVMTLLAQ LSRHSGTHFG IADFKNDYSD ANGFPEFADA EFLDLWEEGA PYNPLALTND
SERSIETAVI ELRDTIEEAT RSFTRMGVRQ KAKLNKALAA AYATGRSEGR WPTLRTLDEQ
LDDDLAGVMG DLTRHRLFKE GPPLGDVIDR NVVFGLSKIP GNGQTTILAA GFILSALLLR
IQNLPPVPNT IRYVGVIDEA HRVADFKAVQ TMIREGRSKG LAVVLATQQP LDLQEVTGAN
AQTRICFGLP DATYATMAAR KMQPDNHRLA EQIRTLGVGE AYLSLRGSAP RLVRMVQAYR
DAERLGLPPL RHI