Gene Veis_3959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_3959 
Symbol 
ID4691804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4344642 
End bp4347554 
Gene Length2913 bp 
Protein Length970 aa 
Translation table11 
GC content68% 
IMG OID639851708 
Productglycine dehydrogenase 
Protein accessionYP_998684 
Protein GI121610877 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0444417 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATGC CATCTGCTGC GCAGCATCCT GTACCGCTCG GTGCACTGGA AAACGCCACC 
GAATTCCTGC CCCGCCATAT CGGCATCGAT GCCGACGACC AGGCCCGGAT GCTGTCGGTG
ATCGGGGAGA CCTCGCGCCG CGCGCTGATC GACAGCATCG TGCCGCGCTC CATCGCGCGC
CGCCAGGCGA TGGAGTTGCC GCTGCCGGTC AGCGAAGCGG CGGCGCTGGC TGAACTGCGG
GCGCTGGCAG CCAGGAACCA GGTTCTCAGG AGCTTCATCG GCCAGGGCTA CTACGGCACG
CACACGCCCG GGGTGATACT GCGCAACATC CTGGAGAACC CGGCCTGGTA CACGGCCTAC
ACGCCCTACC AGGCGGAAAT CTCGCAAGGC CGCATGGAGG CCCTGGTCAA CTTCCAGACC
ATGGTGTGCG ATCTGACCGG CATGCCAATC GCCAACGCCT CGATGCTCGA CGAAGCCACC
GCCGCCGCCG AAGCCATGAT GCTGGCCAAG CGCTCGGTGC CATCGGGCAG CCAGCGCTTC
ATCGTCGCCG GGGACACCCA CCCGCAGACC ATCGAAGTCA TACAGACACG CGCAGGCCCG
CTGGGCATCG AGGTGGTGCG CACCGAGGGC GCCGCCCAGT GGCAGGCGGC ATTGGCCGGC
GACTACTTTG CCGTGCTGGC CCAATACCCG GCCAGCAGCG GCCGCATCGA CGACCTGCGC
GCCGACGTGC AGCAAGTGCA GGCCCGGCAG GCGGCCTTCA TTGTCGCCGC CGACCTGCTG
GCCCTGACCC TGATCACGGC CCCTGGCGAA TGGGGCGCCG ACATCGTGGT CGGCAGCAGC
CAGCGCTTTG GCATGCCCAT GGGCGCCGGC GGCCCGCACG CGGCCTACAT GGCTTGCCGC
GACGAATTCA AACGCTCGCT GCCCGGCCGC CTGGTGGGCG TGAGCCGCGA TGCGCATGGC
CGGCCCGCTT ACCGGCTGGC GCTGCAAACG CGCGAGCAGC ATATCCGCCG CGAAAAAGCC
ACCTCCAACA TCTGCACCGC GCAGGTGCTG CCCGCCGTGA TTGCCAGCAT GTACGCCGTG
TACCACGGCC CGGCGGGCCT CGAGCGCATC GCGCGGCGCG TGGCCTGCTA CACGGCCATC
CTGGCCCGGG GGCTGGCGCA GCTTGGCGTG CCGGTGCGCC CGCAAGCCTG CTTCGACACG
CTGCTGATCG CCACCGGCGA TGTGACCCGA TTCATCGCCG CCAAGGCCGT GAAAATGGGC
GCCAACCTGC GGCTCTACGA TGAAAAATCG CTGTGCATCG CGCTGGACGA AACCACCACC
CGTGGCGATA TCGAACTGCT GTGGAAAGTC TTCTCCAGCG ACGACCAAGC CCAGCCCTGC
TTGGAGACCT TTGAAAACGG CATTGCGCCG CTGATCCCTG CCGGGTTGCA GCGCCGCAGC
CGCTACCTGA CGCACCCGGT GTTCAACACG CACCACAGCG AAACCGGCAT GCTGCGCTAC
ATCCGCCAAC TGTCCGACAA GGACCTGGCG CTCGACCGCA GCATGATCCC GCTGGGCAGT
TGCACGATGA AGCTCAACGC CACCAGCGAG ATGATCCCCA TCACCTGGCC CGGCTTTGCC
GACCTGCACC CGTTTGCGCC CGCCGACCAA TTGCAGGGCT ACCGCGCGCT CGACGCACAA
CTGTGCGCCT GGCTCTGCCA GGCCACCGGC TACGCCGGCA TCAGCCTGCA ACCGAACGCA
GGCTCGCAGG GCGAGTACGC GGGCCTGCTG GCCATACGGG CCTACCACCA GGCCCGGGGC
CAGGGACAGC GCAACATCTG CCTGATCCCG AGCAGCGCGC ATGGCACCAA CCCGGCCAGC
GCCCGGATGG CGGGCCTGCA GGTCGTGGTC AGCGCCTGCG ACGCCAACGG CAATGTCGAT
CTGGCCGACC TCGAAGCCCG GTGCGAACGG CACAGCGCCG AACTGGCGGC CGTGATGATC
ACCTACCCCA GCACGCATGG CGTGTTTGAA ACCGGCGTCA AAGAACTGTG CGCCCTGGTG
CACCGCCATG GCGGCCTGGT GTACGTCGAC GGCGCCAACA TGAACGCGCT GGTGGGCGTG
GCCGCGCCCG GCGAATTCGG TGGCGACGTC AGCCACCTGA ACCTGCACAA GACCTTTTGC
ATTCCGCATG GCGGCGGTGG TCCCGGCGTA GGCCCGGTGT GCGTGGTGCA AGACCTGGTG
CCCTATCTGC CCGGGCATGC CACGACCGGC ACGGCAGGCG GCGTGGGGGC GGTATCTGCG
GCGCCCCTGG GCAACGCGGC GGTGCTGCCC ATCAGTTGGA TGTACTGCCG CATGATGGGC
GCCGAAGGCT TGCAGGCTGC CACCGAGACC GCCATCGTCT CGGCCAACTA CATCAGCGCG
CGCCTGAAAG AGCACTACCC CACGCTGTAT GCCAGCGCCA ACGGCCATGT GGCGCACGAG
TGCATTTTGG ACTTGCGCAG CCTCAAGGAC AGCAGCGGCG TGCTGGCCGA AGACGTGGCC
AAGCGCCTGA TCGACTACGG CTTTCACGCC CCCACGCTGA GCTTTCCGGT GCCCAACACG
CTGATGGTCG AGCCTACCGA GAGCGAAACC CTGTTCGAGC TCGACCGCTT CATCGCCGCG
ATGATCGCCA TCCGCCAGGA AATCCGGCAG ATCGAAATCG GCCTCTGGCC CCGGGACGAC
AACCCGCTCA AGAACGCCCC GCACACGGCC GAAAGCCTGC TCGCCAGCGC ATGGGATCGC
CCCTACACGC GCGCAGTCGC CGCCTACCCG GTGGCGAGCC TGCGCAGCAA CAAATACTGG
CCGCCCGTAG GCCGGGTGGA CAACGTCTGG GGCGACCGCA ACCTGTCATG CAGTTGCATC
CCCGTGGCCG ATGCGGTGTC CGACGTTGCC TGA
 
Protein sequence
MPMPSAAQHP VPLGALENAT EFLPRHIGID ADDQARMLSV IGETSRRALI DSIVPRSIAR 
RQAMELPLPV SEAAALAELR ALAARNQVLR SFIGQGYYGT HTPGVILRNI LENPAWYTAY
TPYQAEISQG RMEALVNFQT MVCDLTGMPI ANASMLDEAT AAAEAMMLAK RSVPSGSQRF
IVAGDTHPQT IEVIQTRAGP LGIEVVRTEG AAQWQAALAG DYFAVLAQYP ASSGRIDDLR
ADVQQVQARQ AAFIVAADLL ALTLITAPGE WGADIVVGSS QRFGMPMGAG GPHAAYMACR
DEFKRSLPGR LVGVSRDAHG RPAYRLALQT REQHIRREKA TSNICTAQVL PAVIASMYAV
YHGPAGLERI ARRVACYTAI LARGLAQLGV PVRPQACFDT LLIATGDVTR FIAAKAVKMG
ANLRLYDEKS LCIALDETTT RGDIELLWKV FSSDDQAQPC LETFENGIAP LIPAGLQRRS
RYLTHPVFNT HHSETGMLRY IRQLSDKDLA LDRSMIPLGS CTMKLNATSE MIPITWPGFA
DLHPFAPADQ LQGYRALDAQ LCAWLCQATG YAGISLQPNA GSQGEYAGLL AIRAYHQARG
QGQRNICLIP SSAHGTNPAS ARMAGLQVVV SACDANGNVD LADLEARCER HSAELAAVMI
TYPSTHGVFE TGVKELCALV HRHGGLVYVD GANMNALVGV AAPGEFGGDV SHLNLHKTFC
IPHGGGGPGV GPVCVVQDLV PYLPGHATTG TAGGVGAVSA APLGNAAVLP ISWMYCRMMG
AEGLQAATET AIVSANYISA RLKEHYPTLY ASANGHVAHE CILDLRSLKD SSGVLAEDVA
KRLIDYGFHA PTLSFPVPNT LMVEPTESET LFELDRFIAA MIAIRQEIRQ IEIGLWPRDD
NPLKNAPHTA ESLLASAWDR PYTRAVAAYP VASLRSNKYW PPVGRVDNVW GDRNLSCSCI
PVADAVSDVA