Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1811 |
Symbol | |
ID | 4027368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 2058312 |
End bp | 2061212 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637967000 |
Product | glycine dehydrogenase |
Protein accession | YP_573862 |
Protein GI | 92113934 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | [TIGR00461] glycine dehydrogenase (decarboxylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAACG ACAAGCGCAG CCTGGCCGAA CTGGCCAACC ACGATGATTT CCTGACGCGT CACAACGGCC CCGACGAGGG CGATGTGCAG CAGATGCTGG AGACGCTGGG ACTTCCCGAT ATCGAATCGC TGATGACCAG CACGATCCCC GGCGACATTC GTCTGGCACG TGAGTTGGCC CTGGAAGAGC CGCGCGGCGA AGCCGAAGCC CTGGAGTATC TCAAGCGTCT GGCACGTCAG AACAAGACCT TCAAGAATTA TATCGGGCAA GGCTACTATC CCACCTACAT GCCAGCAGTG ATCCAGCGCA ACGTGCTGGA GAATCCCGGC TGGTACACGG CCTATACGCC CTATCAGCCC GAAATCGCGC AGGGCCGGCT GGAAGGGTTG CTCAACTTCC AGCAGGTCGT CATGGACTTG ACCGGCATGC CGATCGCCAA CGCTTCGCTG CTCGACGAGG CGACAGCCGC CGCCGAGGCC ATGGCGCTGT GCCGGCGAGC CAACAAGAAG GTCAAGTCGG CGTGCTTTTT CGTGGCCGAC GATGTCTTGC CTCAAACCCT CGATGTATTG CGTACCCGGG CTGCCTATTT CGGCTTCGAG CTCGTCGTCG CGCCGGCGGA AACCCTTCCC GAGCACGAGG TGTTTGGCGC GCTGCTGCAG TATCCCGGCG CCACCGGCGA GGTTCGCGAC CTGGCACCGC TGCTCGAACA GGCCAAGGAG CGACGCGTGA TGACCTGCGT CGCGGCCGAC CTGATGAGCC TGGTCCTGCT GAAAGAGCCC GGCGCGCTCG GTGCCGACAT CGTCGTGGGG AGTTCCCAGC GTTTTGGTGT CCCCATGGGC TACGGCGGGC CGCACGCCGC CTTCTTCGCC GCCAGCGACG CACTCAAGCG CTCGCTGCCC GGACGCATCA TCGGGGTCTC CAAGGACGCT CGCGGTCAAC GCGCCCTGCG CATGGCCATG CAAACCCGCG AGCAGCACAT CCGCCGGGAA AAGGCGACGT CCAACATTTG TACCGCCCAG GCGCTACTGG CCAACATCGC CGGTTTTTAC GCGGTGTACC ACGGCGCCGA TGGCTTGCGC ACCATTGCCA GCCGCATTCA TCGCCTGACC ACCCTGCTGG CCGAGGGCCT CAAGCAGCAC GGCGTCACGC TCGCTCACGA CAGCTGGTTC GACACGCTGA CCCTCAAGGG GCTTGACCAT GGCCAGGTGC ATGGCCGCAG CATGGCGCAT GAGATCAACC TGCGCTACGA CGCCGCCGGC ACGATCGGGG TCAGCCTCGA CGAGACCACC ACGGCCGCCG ACGTGGTCAC GCTGTTCGAC GTCCTGCTGG GCGACGAGCA CGACCTGTCG GTGAGCGAAC TCGACCGCCA CGTCCGCGAG ACAGGCACGA CCGGCATTCC GGCGCATCTC GATCGCGAGA GCGATTTTCT CACCCACCCC ACCTTCCATC GCTATCGCAG CGAAACCGCG ATGCTGCGTT ATCTGAAGCG CCTCGAGAAC AAGGACCTGT CCCTCACCCA TGCCATGATC CCGCTGGGGT CATGCACCAT GAAACTCAAC GCCACCAGCG AAATGGTACC GATCAGCTGG CCGGAGTTCG CCAACATCCA TCCCTTCGCG CCCCACGACC AGGTGGCCGG CTACAAGCAG ATGATCGACG AGCTGTCGGC CTTTCTGGTC GAAATCACCG GCTACGACAG CATTTCCATG CAGCCCAACT CCGGAGCGCA GGGCGAATAC GCGGGCCTGG TCGCCATTCG GCGCTACCAG AAGGCCCAGG GCCAGGGCCA TCGCGACATC TGCCTGATTC CCAGTTCGGC GCATGGGACC AACCCCGCCT CGGCGGCCAT GGCGCAAATG AAGGTGGTGG TCGTCGACTG TGACGACGAA GGCAATATCG ATCTCGAGGA TTTGCGTGGC AAGGCCGAGA AGCACAGCGA ATCGCTCTCG GCGATCATGC TGACCTACCC CTCGACGCAT GGCGTATTCG AGGAAAGCGT GCGCGAGGCC TGCCGGATCG TGCACGATCA CGGCGGCCAG GTGTACATCG ACGGCGCGAA CATGAACGCT CAGGTCGGCC TTTGCCGCCC CGGCGATTTC GGTGGCGACG TCTCGCATCT CAATCTGCAC AAGACCTTCT GTATCCCGCA TGGCGGCGGC GGCCCCGGCA TGGGACCGAT TGGCGTCAAG GCGCATCTGG CGCCCTTCGT GCCCAACCAC GTGGTCACGC CGCTTCCGGG CGTCGACGAA AAGGCCGGCG CGGTCGCCGC GACGGCCTAC GGTTCCGCGT CGATCCTCCC GATCTCCTGG GCCTACATCA AGATGATGGG CGGACGCGGC ATGAAGCGCG CCACCCAGCT GGCGATCCTC AATGCCAACT ACATCGCCAA GCGGCTCGAG GGGCATTACC CGGTGCTCTA CAAGGGCCGC AACGGCACCG TCGCGCATGA ATGCATACTG GACATCCGCC CGCTCAAGGC CGCGTCCGCG ATCAGCGAGG AAGACATCGC CAAGCGCTTG ATGGATTACG GCTTTCATGC CCCGACGATG TCATTCCCGG TCGCCGGCAC CCTGATGGTC GAGCCGACCG AGTCGGAGTC GCGTTACGAG ATCGACCGCT TCTGCGACGC CATGATCGCC ATACGCGAGG AAATCCAGCG TATCGAAACC GGCGAATGGC CCGCGGACAA CAACCCTCTG GTCATGGCAC CGCACACCCA GGCGGACTTG ATGGAAGCCG ACTGGGAACG GCCCTACTCG CGCGAACTCG GCGCCTTCCC CACCGAAGCC ACCAAGGCCG CCAAGTACTG GCCGGCGGTC AACCGCGTCG ACAACGTCTA CGGCGATCGC AACCTCATCT GCACCTGCCC GCCGATCGAC GCCTATCGCG ACGACGTCTG A
|
Protein sequence | MANDKRSLAE LANHDDFLTR HNGPDEGDVQ QMLETLGLPD IESLMTSTIP GDIRLARELA LEEPRGEAEA LEYLKRLARQ NKTFKNYIGQ GYYPTYMPAV IQRNVLENPG WYTAYTPYQP EIAQGRLEGL LNFQQVVMDL TGMPIANASL LDEATAAAEA MALCRRANKK VKSACFFVAD DVLPQTLDVL RTRAAYFGFE LVVAPAETLP EHEVFGALLQ YPGATGEVRD LAPLLEQAKE RRVMTCVAAD LMSLVLLKEP GALGADIVVG SSQRFGVPMG YGGPHAAFFA ASDALKRSLP GRIIGVSKDA RGQRALRMAM QTREQHIRRE KATSNICTAQ ALLANIAGFY AVYHGADGLR TIASRIHRLT TLLAEGLKQH GVTLAHDSWF DTLTLKGLDH GQVHGRSMAH EINLRYDAAG TIGVSLDETT TAADVVTLFD VLLGDEHDLS VSELDRHVRE TGTTGIPAHL DRESDFLTHP TFHRYRSETA MLRYLKRLEN KDLSLTHAMI PLGSCTMKLN ATSEMVPISW PEFANIHPFA PHDQVAGYKQ MIDELSAFLV EITGYDSISM QPNSGAQGEY AGLVAIRRYQ KAQGQGHRDI CLIPSSAHGT NPASAAMAQM KVVVVDCDDE GNIDLEDLRG KAEKHSESLS AIMLTYPSTH GVFEESVREA CRIVHDHGGQ VYIDGANMNA QVGLCRPGDF GGDVSHLNLH KTFCIPHGGG GPGMGPIGVK AHLAPFVPNH VVTPLPGVDE KAGAVAATAY GSASILPISW AYIKMMGGRG MKRATQLAIL NANYIAKRLE GHYPVLYKGR NGTVAHECIL DIRPLKAASA ISEEDIAKRL MDYGFHAPTM SFPVAGTLMV EPTESESRYE IDRFCDAMIA IREEIQRIET GEWPADNNPL VMAPHTQADL MEADWERPYS RELGAFPTEA TKAAKYWPAV NRVDNVYGDR NLICTCPPID AYRDDV
|
| |