Gene Mvan_1009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1009 
Symbol 
ID4645794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1055640 
End bp1058048 
Gene Length2409 bp 
Protein Length802 aa 
Translation table11 
GC content71% 
IMG OID639804510 
Productputative outer membrane adhesin like proteiin 
Protein accessionYP_951853 
Protein GI120402024 
COG category 
COG ID 
TIGRFAM ID[TIGR01965] VCBS repeat 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.227831 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGCAC AGCAACGCGT CGGGTGGGTC GCCGGTGTGG CGGCGTCGGC GGGCATCGGA 
GCCGCGGTGC TGCTCGGGGG GACCGGTACA GCGACCGCAG ATGCGCCCTC GTCGGCGTCG
GACACCGGGG ATCCCAGCAG CGGGGCCGCG GTACGCGGCG ACACCGACCA CGACGCGGTC
ACCGCGTCGG CGGAGTCGGT CGACGCTGAC GAGGACCCGC GCGATGACGA AGACCCGCGC
GATGACGTTG CCGGGCAGAC GGTCTCGTCA GCGTCGTCGG GTGAGAAAAC CGACGAGATC
GATGACGCCG TCGAGGTCGC CGACGCCGAC GAGGAGGCCG CCGCCGCGGG GGCAACCGAG
AGCGAGGGCA CCGACAGCGG CGACCCCGCG CCGGCGCAAA CCCAGGCCGT CGCGTCGCTG
ATGTCGTCCG CGCGCCGCGG ATCCGCGGAG CCCGGCGAGG CCGCCGCTGA GGTGCCGAAC
TCTGCGCCCA GCGTGACAAC GTCGGTGGGC GTGCCCGACC CGCTCAGCGG CGTCACGAAT
ATCGCGGTGA CCGGCCTTGA TCCGGACGGC GACCAGCTGC GCTACACCGC GGCCAGACCC
ACGTTCGGAC GCGTCACCGG CGACGGCACC GGCGCGTTCA CCTACACACC GACGTCCTTC
TCTCGGCTGC TGGCGCGGTT CGTGCCGTTC GTCCGGTCCG ACCGCTTCGC GGTGACGGTC
AGCGACGGAC GCGGCGGCGT CACCTCGTCG ACGGTGAACC TGACGATTGT GCCGCTCAAC
AGCGCACCGA GGCCCCGCGC CTCCACGGTC AACGCACCGG TGCCGGCGAC CGGCGTGGTG
ACCGGGAAAG CCAATGCCAC CGACCCGAAC TGGGACCGTC TCACGTTCGT GGCGTCGACC
GTCGACACCG CCAAGGGCCG TGTCACCGTG AACGGCAACG GCACGTTCAT TTACACCCCC
ACGGCGGCGG CCCGCCACGG CGCCGCGGCC GCCGCCGCGT CGACGGCCGA CCGCACCGAC
ACCTTCCGCG TCGCGGTCGC CGACGCCTAC GGCGCAGTCA CCGAAATCCC GGTCACGGTG
ACGATCAGCC CGGCCAACGT GGCGCCGACC GCCACTGCCG GTGTGCAGAA CCCCGATGCG
GCAACCGGAT TGGTCGCCGG GACGGTGATC GGGACCGACG CCGACGGCGA CGGCCTCACC
TACAGCGGCC CGACCGCCAC CGCCAAGGGG ACGGTTGTGG TGGCCGCCGA CGGCGCCATC
ACGTACCGGC CCAACGAGAT TGCCCGGCGG ATCGCCGCCT CGGTATACTC CACACCCGCG
TCGCGATCCG ATACGTTCGG TGTCACGGTG ACCGACGGGC ACGGCGGCAG CGCCACCGTC
ACCGTCACCG TCGCGATCGC TCCCGACACC CAGTCGGCCG CCGCCGTGCC GCCGTCCACG
TTCTGCGGGT GCACGCTGAT GCCGACGGAC ACCATCTTCC ATGCCGACAT CCGCGGCCTG
CCAACACTTT CCGAATCGAA CTCGTGGATC GAGCTCCTCG GCGGCGGGCG CGGCGCCACG
CTGGCGGCCC GCTGGGGCGG CTCCGAGTGG ATGGGAAGCA CCGCCGGTAT CCCGGTCAAC
GTCGTCGGAG CCGACCACCC CACGGAGGAC GTCGTGTTCA ACCGCGGCTA CTCGACCACA
GGGCCCGGCA TCGACGACCG GCCGTATGCG ATTCCCGATC GCCCGCTGGT GGAGGGCATG
CCGTCGTATC CCGCCTGGGA CCGGCACCTG TTCGTGTTCC AGGAAGGCAC CTGCATCTCG
CAGGAGTTGA TCAACGTCGC CAACGGCGTC GAACTGCCCG GCGCGGGGAT CCTGGACATC
CTCGGCAACG CCGTCTACCG GTCGATCTGG GGCTCTTCGT GGATTGCGCA GGGCGGGGTG
CACTACGACA TGAACTCGGG CCTGTATCCC GCCATCGGAT ACGCCAACGC CTCGCAGCTT
CCCCAGGTGC CGATGATGTT GCGCCCCGAC GAGATCGAGC GGGGATACGT CGACCACATG
CTGGGCATGA CCATCGCCAA AGACCTCGGT GCGGGCTACG TCTGGCCGGC GCGCGCGGGG
GACGGGAGCG GTGCCGATGG TGTTCCGATG GGGATGGTTT TCCGGCTGCG CGAGGACGTC
GACCTCAGCG GGTACGCCGA ATCGACACAG GCGGTGCTGC GCGCGCTCCA GGTGCACGGG
GCCGTCATCT ACGACTCGAG TGCCCCCGGC GGTGACGGCC TGAATCTCGC GGGCATGAGC
AACGGGTGGG AGGGCTCCGA CCACCTGGTG ATGCAGCGCG AGCTGAGCAC GATCCCGGTG
CAGTGGTTCG AAGCTGTCGA CGTGGTGGGC CTCGCCGCCG ACCCCGCCGT CGGCTGGCAG
GTCAAATGA
 
Protein sequence
MSAQQRVGWV AGVAASAGIG AAVLLGGTGT ATADAPSSAS DTGDPSSGAA VRGDTDHDAV 
TASAESVDAD EDPRDDEDPR DDVAGQTVSS ASSGEKTDEI DDAVEVADAD EEAAAAGATE
SEGTDSGDPA PAQTQAVASL MSSARRGSAE PGEAAAEVPN SAPSVTTSVG VPDPLSGVTN
IAVTGLDPDG DQLRYTAARP TFGRVTGDGT GAFTYTPTSF SRLLARFVPF VRSDRFAVTV
SDGRGGVTSS TVNLTIVPLN SAPRPRASTV NAPVPATGVV TGKANATDPN WDRLTFVAST
VDTAKGRVTV NGNGTFIYTP TAAARHGAAA AAASTADRTD TFRVAVADAY GAVTEIPVTV
TISPANVAPT ATAGVQNPDA ATGLVAGTVI GTDADGDGLT YSGPTATAKG TVVVAADGAI
TYRPNEIARR IAASVYSTPA SRSDTFGVTV TDGHGGSATV TVTVAIAPDT QSAAAVPPST
FCGCTLMPTD TIFHADIRGL PTLSESNSWI ELLGGGRGAT LAARWGGSEW MGSTAGIPVN
VVGADHPTED VVFNRGYSTT GPGIDDRPYA IPDRPLVEGM PSYPAWDRHL FVFQEGTCIS
QELINVANGV ELPGAGILDI LGNAVYRSIW GSSWIAQGGV HYDMNSGLYP AIGYANASQL
PQVPMMLRPD EIERGYVDHM LGMTIAKDLG AGYVWPARAG DGSGADGVPM GMVFRLREDV
DLSGYAESTQ AVLRALQVHG AVIYDSSAPG GDGLNLAGMS NGWEGSDHLV MQRELSTIPV
QWFEAVDVVG LAADPAVGWQ VK