Gene Mvan_3131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3131 
Symbol 
ID4646161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3327511 
End bp3330363 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content70% 
IMG OID639806608 
Productglycine dehydrogenase 
Protein accessionYP_953939 
Protein GI120404110 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0707314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.211977 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAGATC AGTTCCGCAC CCCCTCCTTC GTCGACCGGC ACATCGGGCC CGACGCGCAC 
GCCGTCGCGA CCCTGCTCGG CACCATCGGC GTCAGCTCCC TTGACGAACT CGCGGCCAAG
GCGCTGCCCG CCGGGATCCT CGATCCGCTG ACCGGGGCCG GCACCGCGCC GGGGCTCGAG
CATCTGCCGC CGGCCGCCAC CGAACACGAG GCGCTGGCCG AACTGCGTGC GCTGGCCGAG
TCGAACACCG TCGCGGTGTC GATGATCGGG CAGGGGTACT ACGACACGCT CACCCCGCCG
GTGTTGCGTC GCAACATCCT TGAGAACCCG GCGTGGTACA CCGCCTACAC GCCGTACCAG
CCCGAGATCA GCCAGGGCCG GCTCGAGGCG CTGCTCAACT TTCAGACGAT GGTCAGCGAC
CTGACCGGCC TCGAGGTCGC CAACGCGTCG ATGCTCGACG AGGGGACCGC TGCCGCCGAG
GCGATGACGC TGATGCACCG CGCCGTGCGC GGACCGTCCA ACCGTCTGGT GGTCGACAGC
GACGTCTACG CGCAGACGGC CGCGGTGCTT GCCACCCGCG CCGAACCGCT CGGCATCGAG
ATCGTGACCG CCGATCTGCG CCACGGCCTG CCCGACGGTG ACTTCTTCGG CGTGATCGTG
CAGCTGCCGG GCGCCGGCGG CGCGATCACC GACTGGTCCG AACTGGTCAC CCAGGCCCAC
GACCGCGGCG CCCTGATCGC TGTCGGTGCG GACCTGCTCG CGCTGACACT GATCGCCCCG
CCGGGTGAGA TCGGCGCCGA CGTCGCGTTC GGAACGACCC AGCGCTTCGG GGTGCCGATG
GGATTCGGCG GACCGCACGC GGGATACCTT GCGGTGCATT CCAAGCACGC CCGTCAATTG
CCCGGCCGGC TGGTCGGGGT GTCGGTGGAT GCCGACGGTG CCGCAGCGTT CCGATTGGCG
TTGCAGACCC GCGAACAGCA CATCCGGCGC GACAAGGCCA CCAGTAACAT CTGCACGGCG
CAGGTGCTGC TGGCCGTGAT GGCGGCGATG TACGCCAGCT ATCACGGCGC GGACGGGCTG
CGGGCGATCG CCCTGCGCGT GCATGCGCAG GCGTCGGCGC TGGCCGCCGG CCTGTCCGGC
GCGGGCATCG AGGTGGTGCA CCCGTCGTTC TTCGACACCG TGCTGGCCCG GGTGCCGGGA
CGCGCGAGCG AGGTGCGCGC CGCCGCCAAG GAGCGCGGCA TCAACGTCTG GCTCGTCGAC
GACGACCACA TCTCGGTGTC GTGCGACGAG GCCACCACGG ACGGCCACGT CGCCGACGTG
CTCGCCGCGT TCTCGGCCGA GCCGACGGTC GGCGGGGACG TCACCGCCGC CTCGATCGTC
ACCAGGATCT CGGAGTTCCT CACCCACCCG GCGTTCACCA GGTACCGCAC CGAAACCGAG
ATGATGCGCT ATCTGCGTGC GCTGGCGGAC AAGGACATCG CGTTGGACCG CAGCATGATT
CCGCTGGGCT CGTGCACCAT GAAGCTCAAC GCGGCCGCGG AGATGGAGCC GATCACCTGG
CCGGAGTTCG CCCGTCAGCA TCCGTTCGCT CCCGCGTCGG ACACCCCCGG AATCCGCCGC
CTGATCGCCG ATCTGCAGGG GTGGCTCACC GGCATCACGG GCTATGACGA GATCTCGCTG
CAGCCCAATG CCGGCTCGCA GGGGGAGTAC GCCGGGCTGC TGGCGATCCG GGCCTACCAC
GCGGCACGCG GGGACACCGA CCGCACCGTG TGCCTGATCC CGGCGAGCGC ACACGGCACC
AACGCCGCTT CCGCGGCGAT GGTCGGCATG CAGGTCGTGG TCGTCGCGTG CCGCGCCAAC
GGCGACGTCG ACCTCGACGA TCTGCGTGCC AAGGTCACCG AGCACGCCGA ACGACTGGCC
GCGTTGATGA TCACCTATCC GTCGACGCAC GGCGTCTACG AGCATGACAT CGCAGAGATC
TGCGCCGCGG TGCACGACGC GGGCGGTCAG GTCTACGTCG ACGGTGCCAA CCTGAACGCG
CTGGTGGGGC TGGCCCGTCC CGGCAGGTTC GGCGGCGACG TCAGCCACCT GAACCTGCAC
AAGACGTTCT GCATCCCGCA CGGCGGCGGT GGCCCGGGTG TGGGTCCGGT CGCGGTGCGC
TCGCACCTGG CGCCCTATCT GCCCGGGCAT CCGCTGGCCG ACGAGCTCTC CGATGAACAC
ACGGTGTCGG CCGCCCCGTA CGGTTCGGCG TCGATCCTGC CGATCACGTG GGCCTACATC
CGGATGATGG GCGCGCAGGG ACTGCGGTCG GCCTCGCTGG TGGCGATCGC GTCGGCCAAC
TACATCGCCC GCCGCCTCGA CGAGTACTAC CCGGTGCTCT ACACCGGGGA GAACGGCATG
GTGGCGCACG AGTGCATCCT CGATCTGCGT GCGATCACGA AGGCCACCGG AGTGACGGTG
GACGACGTCG CCAAGCGGCT CGCCGATTAC GGCTTCCACG CACCGACGAT GAGTTTCCCC
GTCGCGGGCA CGTTGATGGT CGAGCCCACC GAGAGCGAGT CGCTGGCCGA GGTCGACGCG
TTCTGTGAGG CGATGATCGC GATCCGCGGC GAGATCGACA AGGTCGGTTC CGGGATGTGG
TCCGTTGACG ACAACCCGCT GCGCGGAGCA CCGCACACCG CCGAGAGTCT GCTGGTCGAG
GACTGGCACC ACCCGTACAC ACGCGAGCAG GCCGCCTACC CGCTGGGCAA GGGCTTCCGG
CCCAAGGTGT GGCCGCCCGT GCGGCGCATC GACGGCGCCT ACGGCGACCG CAACCTGGTC
TGCTCGTGCC CGCCGGTAGA GGCGTTCGCG TGA
 
Protein sequence
MSDQFRTPSF VDRHIGPDAH AVATLLGTIG VSSLDELAAK ALPAGILDPL TGAGTAPGLE 
HLPPAATEHE ALAELRALAE SNTVAVSMIG QGYYDTLTPP VLRRNILENP AWYTAYTPYQ
PEISQGRLEA LLNFQTMVSD LTGLEVANAS MLDEGTAAAE AMTLMHRAVR GPSNRLVVDS
DVYAQTAAVL ATRAEPLGIE IVTADLRHGL PDGDFFGVIV QLPGAGGAIT DWSELVTQAH
DRGALIAVGA DLLALTLIAP PGEIGADVAF GTTQRFGVPM GFGGPHAGYL AVHSKHARQL
PGRLVGVSVD ADGAAAFRLA LQTREQHIRR DKATSNICTA QVLLAVMAAM YASYHGADGL
RAIALRVHAQ ASALAAGLSG AGIEVVHPSF FDTVLARVPG RASEVRAAAK ERGINVWLVD
DDHISVSCDE ATTDGHVADV LAAFSAEPTV GGDVTAASIV TRISEFLTHP AFTRYRTETE
MMRYLRALAD KDIALDRSMI PLGSCTMKLN AAAEMEPITW PEFARQHPFA PASDTPGIRR
LIADLQGWLT GITGYDEISL QPNAGSQGEY AGLLAIRAYH AARGDTDRTV CLIPASAHGT
NAASAAMVGM QVVVVACRAN GDVDLDDLRA KVTEHAERLA ALMITYPSTH GVYEHDIAEI
CAAVHDAGGQ VYVDGANLNA LVGLARPGRF GGDVSHLNLH KTFCIPHGGG GPGVGPVAVR
SHLAPYLPGH PLADELSDEH TVSAAPYGSA SILPITWAYI RMMGAQGLRS ASLVAIASAN
YIARRLDEYY PVLYTGENGM VAHECILDLR AITKATGVTV DDVAKRLADY GFHAPTMSFP
VAGTLMVEPT ESESLAEVDA FCEAMIAIRG EIDKVGSGMW SVDDNPLRGA PHTAESLLVE
DWHHPYTREQ AAYPLGKGFR PKVWPPVRRI DGAYGDRNLV CSCPPVEAFA