Gene Mvan_3216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3216 
Symbol 
ID4647616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3425355 
End bp3427244 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content67% 
IMG OID639806692 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_954023 
Protein GI120404194 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTACT CCAATCACAT CGGACGCGTC GGGGCACTTG CGGTGACCCT GGGTGTGGGT 
TGGGCGGTGG CGTCGGCACC GGGCGTCGCG CACGCCGAGC CGTCCGCCCC GTCGGCATCC
AGCGAGTCCT CGTCGACGAC GACGACCTCG AATGAGGAGT CGTCGGCGAA GGTCGGGACC
ACCGCTGACG ACGGTGCGGC GGCTCCCGAC GACGAAGCCG AGAAAGAGGT CGCCGAGGCC
GACGACGAGG AAGTCGTCGA CGAAGACGAA GACGAGGACG AGGAGGCCCA GGACGTCGAG
GCCCAGGACG TCGAGGCTCA GGACGCAGTC GACGTCGAGG ACACTGACAG CGTCGACGTC
GACGATGCCG AGTACGCCGA AGCGGCAGTG GTCACGCTGG TCTCCGATCA GGTCACGTCC
GAGCCCGCAC CGCAACCCGA GCCGACCCCC GCCGCACCGA TGGGGGGATC GGCGATGCTG
GCGTCGCTGG CCGCCGTGCG GGACGAGTTG GAGCGCAACG TGATTCGCCG CAACACAGTG
GCGACTCAGG TCACCGCACT GGCCGACGAA ACCCCCAACG TGCTGCTCAT CGGGGTGGAC
GGCACCAACC TCAGCCGTGT GCTGGCCAAC CCCGCCACCA CCACGAACTT CTTCAGCCTG
ATCCAGAACG GCACCACCGC CGCCTCGACC ATCGTCGGGC ACACCACCAT CTCGAACCCG
TCGTGGTCGT CGATCCTGAC CGGCGCGTGG GGGGAGAAGA CGGGCGTCAT CAACAACATC
TTCACCCCGT GGACCTACGA AAAGTGGCCG ACGGTGTTCA CCCAGCTCGA GACGCTCGAC
GGCGACATCG TGACCACGTC GATCGCCAAC TGGAACGTCA TCTCGGGCAT CGCCGACTCC
GGGCTCGGCG CCGACACCGT CGTCAACGTG TCCCAGGTGG AAGGGGACAC GAACTGGTTG
CTGACCGACG ACGAGGTCGG TGACCTGACC GAGGCCGCCA TCGCCGCCGC CAGCGCAGAT
GCCGCCAACT TTATGTTCAG CTACTTCGTC GGCGTGGACG AGAACGGGCA CCTGTACGGC
GGTGACTCGC CGGAATACGC CGCGGCCGTG GCGAACTTCG ACCGCAATCT CGGCGAGATC
CTGCAGGCCG TCAGCACCTG GGAGGCTGCG ACGGGGGAGA AGTGGACGAT CATCATGGTG
ACGGACCACG GCCACCAGGC GCAGAAGGGG CTCGGTCACG GCTTCCAATC ACCGGACGAG
ACATCGACAT TCGTCATCGC GAGCAACCCC GAACTGTTCG GCCAGGGCGT GATCAACCTG
AAGTACTCGA TCGTCGACGT GACGCCGACG GTGCTGTCGC TGTTCGGTTT CGAGCCCGCC
GAAGACTCCG ACGGAGTGCC GCTGACCGAC CTCGACGACG CCGACGTCAC GCCCGTCGAC
AATGACGCCG CGCTGCGCGG GGCGCTGCTC GACATCATCG GCAAGTACGG GTACCCGGAC
ATCGGGACCA CCCTCGCGCT CGGCGCCCGC ACCATCTTCG CCTCGGTGCC GTACTACGTC
GACATGCTCA CCACCGGCAT CACCGACAGC CTGCAGACGA TCGCCGACGC GGGCATCTTC
CTGATCAGCC CGCTGGCGCA GCTGGCGATC GTGCCGGTCA AGTTCGTGGG CGATCTCGCC
TATGTCGCGA CGAACTTCGT CGCACAGATC GTCGCCCGTC TCACCGGAGT GACCGGGGCC
AGCATCTTCC CGCTGTGGCC GCCGGCGCCG CCGACCTTCC CCGAATCTCC GGAGGAGCTC
TCGACACCGG ATCTGGTGGC GCTGGTGTGC AGCGACGGGC GGGTGTCGAG CGCGGTGTTC
GCGTGCGGGG CTGCCGCGGT CGCGGTCTGA
 
Protein sequence
MGYSNHIGRV GALAVTLGVG WAVASAPGVA HAEPSAPSAS SESSSTTTTS NEESSAKVGT 
TADDGAAAPD DEAEKEVAEA DDEEVVDEDE DEDEEAQDVE AQDVEAQDAV DVEDTDSVDV
DDAEYAEAAV VTLVSDQVTS EPAPQPEPTP AAPMGGSAML ASLAAVRDEL ERNVIRRNTV
ATQVTALADE TPNVLLIGVD GTNLSRVLAN PATTTNFFSL IQNGTTAAST IVGHTTISNP
SWSSILTGAW GEKTGVINNI FTPWTYEKWP TVFTQLETLD GDIVTTSIAN WNVISGIADS
GLGADTVVNV SQVEGDTNWL LTDDEVGDLT EAAIAAASAD AANFMFSYFV GVDENGHLYG
GDSPEYAAAV ANFDRNLGEI LQAVSTWEAA TGEKWTIIMV TDHGHQAQKG LGHGFQSPDE
TSTFVIASNP ELFGQGVINL KYSIVDVTPT VLSLFGFEPA EDSDGVPLTD LDDADVTPVD
NDAALRGALL DIIGKYGYPD IGTTLALGAR TIFASVPYYV DMLTTGITDS LQTIADAGIF
LISPLAQLAI VPVKFVGDLA YVATNFVAQI VARLTGVTGA SIFPLWPPAP PTFPESPEEL
STPDLVALVC SDGRVSSAVF ACGAAAVAV