Gene TM1040_2391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2391 
Symbol 
ID4076717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2512654 
End bp2515503 
Gene Length2850 bp 
Protein Length949 aa 
Translation table11 
GC content60% 
IMG OID638007713 
Productglycine dehydrogenase 
Protein accessionYP_614385 
Protein GI99082231 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.572716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTCG AACCGACAGA TTACCTCCCG TATGACTTTG CCAATCGGCG CCACATCGGT 
CCCTCGCCCG AGGAAATGGC CGAGATGCTC AAGGTGGTCG GCGCCGACAG CCTAGATGCG
CTGATCGATG AGACCGTGCC GCAATCCATC CGTCAAAAAG CAGCGCTGGA CTTTGGCCGC
CCGATGTCCG AGCGCGAGCT GCTGTTTCAC ATGCGCGAGG TCGCAGGCAA GAACAAGGTA
ATGACCTCGC TGATCGGCCA AGGCTATCAC GGCACCGTGA CCCCGCCCGC GATTCAGCGC
AACATTCTTG AAAATCCTGC CTGGTACACG GCCTACACGC CCTATCAGCC CGAGATTTCG
CAGGGGCGTC TTGAGGCACT CTTGAACTTC CAGACCATGA TCTCTGATCT GACCGGTCTT
GAAATCGCCA ACGCTTCGCT TCTGGACGAG GCCACCGCCT GCGCCGAAGC CATGACCATG
GCGGAGCGTG TCTCCAAGTC CAAAGCCAAG GCTTTCTTTG TCGACCGCGA CTGTCACCCG
CAGAACATCG CGGTGGTGAA AACCCGCGCC GAACCGCTAG GTATCGAGGT GATCGTTGGC
AACCCCGACA AGATGGATCC CGAGGCTGTC TTTGGTGCAT TGTTCCAGTA TCCGGGCACC
TATGGCCATG TGCGCGACTT TACCGACCAC ATTGCCAAGC TCCACGAACA CAAGGGCATC
GCTGTTGTTT CTGCCGACCC GATGTCGCTC ACGCTGCTGA AAGAGCCGGG TGCGATGGGA
GCAGATATTG CGGTCGGCTC CACCCAGCGT TTTGGCGTGC CTGTTGGGGC AGGGGGGCCG
CACGCGGCCT ATATGGCGAC CAAAGACGCC TACAAGCGCA ACATGCCGGG CCGGATCGTC
GGTGTGTCTG TCGATGCACA TGGCAACAAA GCTTACCGTC TGTCGCTGCA GACCCGGGAG
CAGCACATCC GCCGTGAAAA AGCGACCTCG AACGTCTGCA CTGCGCAAGC GCTTCTTGCG
GTGATGGCGT CCATGTATGC GGTCTTCCAC GGTCCCAAAG GTCTGAAGGC GATCGCGCAA
CGCATCCACC GCAAGGCCGT GCGTCTGGCC AAAGGCCTCG AGGAGGCCGG CTTCAAAGTC
GACCCGCAGG CTTTCTTTGA TACCATTACC GTCGATGTTG GACCGCTGCA GGCAGCCGTC
ATGAAATCGG CGGTGGACGA AGGGATCAAC CTGCGTCGCG TGGGTGAAAC CCGTGTGGGG
ATCTCGGTGG ATGAAACCAC CCGCCCCGAA ACCATCGAAG CGGTTTGGCG TGCCTTTGGC
ATCGTGCGTG CGGATGATGA TTTCACCCCG GACTACCGCG TGCCTGCGAA CATGCATCGC
AAGTCGGACT ACCTGACGCA CCCGATCTTC CACATGAACC GCGCCGAGAC CGAGATGATG
CGCTACATGC GTCGCCTTGC GGATCGCGAT CTGGCGCTGG ACCGTGCGAT GATCCCGCTT
GGCTCCTGCA CCATGAAGCT CAACGCGGCA GCGGAAATGA TGCCGCTCAG CTGGCCAGAG
TTTTCGACCA TCCACCCCTT TGCACCTGCG GATCAGCAGG CTGGCTATGG CGAGATGGTT
GAAGACCTTT CGAAGAAGCT CTGCGACATT ACCGGCTATG ATGCGATCTC CATGCAGCCG
AACTCTGGCG CGCAGGGCGA ATATGCTGGC CTCCTGACCA TTGCCGCCTA TCACAAGGCG
CGCGGTGAAG GGCACCGTAA TATCTGCCTG ATCCCGATGA GCGCCCACGG CACCAACCCG
GCCTCCGCTC AGATGGTTGG CTGGAAAGTG GTTGTGGTCA AATCCGACGA GCGCGGCGAC
ATCGATCTTG AGGACTTCCG TGCCAAGGCC GAGAAACACG CCGACAACCT CGCGGGCTGC
ATGATCACCT ACCCATCGAC CCATGGCGTG TTTGAGGAAA CCGTGCATGA GGTCTGCAAA
ATCACCCATG ATGCGGGTGG TCAGGTCTAT ATCGATGGCG CCAACATGAA CGCCATGGTG
GGGCTGAGCC GTCCGGGCGA TCTGGGGGGG GATGTGAGCC ACCTCAACCT GCACAAAACC
TTTGCCATTC CGCATGGCGG TGGGGGCCCC GGCATGGGAC CGATCGGCGT CAAAGCGCAT
CTGGTCGAAC ACCTCCCGGG GCATCCTGAA ACCGGTGGAT CCGAAGGGCC TGTCTCTGCG
GCGCCCCTGG GGTCGGCGTC GATCCTGACG ATCTCCTGGG CCTATTGCCT GATGATGGGC
GGTGCAGGTC TGACGCAGGC GACGAAGGTT GCGATCCTCT CGGCCAACTA TCTGGCCAAA
CGGCTCGAAG GTGCGTTTGA CGTGCTCTAC AAAGGACCGA CAGGTCGGGT TGCGCATGAG
TGCATTCTCG ATACGCGTCC CTTTGCCGAC AGCGCAGATG TGACCGTGGA CGATGTGGCC
AAGCGTCTGA TGGACAGCGG TTTCCACGCT CCTACCATGA GCTGGCCCGT TGCGGGCACT
CTGATGGTGG AGCCAACCGA GTCCGAAACC AAGGCGGAAC TGGATCGTTT TGTCGATGCG
ATGTTGTCGA TCCGTGACGA GATCAAAGCG GTCGAGTCCG GCGAGATGCC GCGCGAGAAC
AATGCGCTCA AGAACGCTCC GCACACGATG GAAGATCTGG TGAAAGACTG GGATCGCCCC
TACTCGCGCG AGCAGGGCTG CTTCCCGCCG GGTGCTTTCC GTGTCGACAA ATATTGGCCG
CCCGTGAACC GCGTCGACAA CGTCTATGGC GACCGTCACC TGATCTGCAC CTGCCCGCCG
CTTGAGGACT ACGCAGAAGC GGCAGAGTGA
 
Protein sequence
MPFEPTDYLP YDFANRRHIG PSPEEMAEML KVVGADSLDA LIDETVPQSI RQKAALDFGR 
PMSERELLFH MREVAGKNKV MTSLIGQGYH GTVTPPAIQR NILENPAWYT AYTPYQPEIS
QGRLEALLNF QTMISDLTGL EIANASLLDE ATACAEAMTM AERVSKSKAK AFFVDRDCHP
QNIAVVKTRA EPLGIEVIVG NPDKMDPEAV FGALFQYPGT YGHVRDFTDH IAKLHEHKGI
AVVSADPMSL TLLKEPGAMG ADIAVGSTQR FGVPVGAGGP HAAYMATKDA YKRNMPGRIV
GVSVDAHGNK AYRLSLQTRE QHIRREKATS NVCTAQALLA VMASMYAVFH GPKGLKAIAQ
RIHRKAVRLA KGLEEAGFKV DPQAFFDTIT VDVGPLQAAV MKSAVDEGIN LRRVGETRVG
ISVDETTRPE TIEAVWRAFG IVRADDDFTP DYRVPANMHR KSDYLTHPIF HMNRAETEMM
RYMRRLADRD LALDRAMIPL GSCTMKLNAA AEMMPLSWPE FSTIHPFAPA DQQAGYGEMV
EDLSKKLCDI TGYDAISMQP NSGAQGEYAG LLTIAAYHKA RGEGHRNICL IPMSAHGTNP
ASAQMVGWKV VVVKSDERGD IDLEDFRAKA EKHADNLAGC MITYPSTHGV FEETVHEVCK
ITHDAGGQVY IDGANMNAMV GLSRPGDLGG DVSHLNLHKT FAIPHGGGGP GMGPIGVKAH
LVEHLPGHPE TGGSEGPVSA APLGSASILT ISWAYCLMMG GAGLTQATKV AILSANYLAK
RLEGAFDVLY KGPTGRVAHE CILDTRPFAD SADVTVDDVA KRLMDSGFHA PTMSWPVAGT
LMVEPTESET KAELDRFVDA MLSIRDEIKA VESGEMPREN NALKNAPHTM EDLVKDWDRP
YSREQGCFPP GAFRVDKYWP PVNRVDNVYG DRHLICTCPP LEDYAEAAE