Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1658 |
Symbol | |
ID | 3747676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 2156441 |
End bp | 2157886 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637774196 |
Product | glycine dehydrogenase subunit 2 |
Protein accession | YP_379953 |
Protein GI | 78189615 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAAC CCCTCATTTT TGACCTCTCT CGCCCCGGGC GTAAGGGATA CAGCTTGTCG CCATGCGACG TGCCTGAAGT TCCACTTGAA TCCATTATTC CAGCATCGTT GCTTCGTAAG GAGGCGGTGG AGTTGCCCGA AGTGGCAGAA AATGAGGTGG TGCGCCACTT TGTGCGCCTT TCAAACCTCA ACTATCATGT TGATAAAAAT ATGTACCCGT TGGGCAGTTG TACCATGAAG TACAATCCCA AAGTGAATGA TTACACTTGT GATTTGTCGG GCTTTAGCGC GCTCCATCCA TTGCAGCCCA CCAGCACAAC GCAAGGTGCT TTGCAGTTGA TGTATGAGTT ATCCAACATG TTAGCTGAAA TTGCTGGCAT GGCTGGCGTG AGTTTGCAAC CAGCCGCAGG TGCACATGGT GAGTTAACGG GCATTTTGCT GATTAAAAAA TATCATGAAG TGCGTGGCGA TAAGCGCCAT AAGCTCTTGG TGGTAGATTC AGCGCATGGC ACGAACCCCG CTTCTGCCGC ACTTGCGGGC TACGAAACCA TCTCCGTTAA AAGCAATGGC GATGGACGTA CTGACCTTGA GGATTTACGT AGCAAGTTAG ATGGCGATGT TGCAGCGCTT ATGCTTACCA ATCCCAATAC GATTGGATTG TTTGAAAAAG AGATTGTGCA AATTGCCGAA ATGGTACACG CCAATGGTAG CTTACTTTAT ATGGATGGCG CCAATATGAA TGCGCTGCTT GGTATTACTC GCCCTGGTGA TATGGGTTTT GATGTTATGC ACTACAATCT CCATAAAACC TTTGCAGCTC CGCACGGCGG CGGTGGTCCA GGTAGCGGTC CCGTTGGTGT GAATGAAAAA CTACTGCCAT ACCTTCCTGC TCCGCTTGTT GTTAAAGAGG GCGACACTTA CCGCTTAACA TCGGGTGGCG ATGACTCCAT TGGGCGTATG ATGAACTTTT ATGGCAACTT TGCTGTCTTG GTGCGTGCCT ACACTTACAT TCGGATGTTG GGAGCTGAAG GGCTGCGCCG AGTTTCGGAA AACGCCATTA TTAACGCCAA CTACCTTTTG AGCAAATTGC TTGAGCGCTA CGAGCTGCCT TATCCAAAAC CTGTGATGCA CGAATTTTGC TTGTCGGGTG ATAAGCAGAA AAAAGCGCAT GGCGTTAAAA CGCTTGATAT TGCAAAGCGT TTGCTTGATT ATGGGTTCCA TGCTCCAACC ATTTACTTCC CGCTTATTGT AAGCGAAGCC TTAATGATTG AGCCAACTGA AACCGAGTCG AAAGAAACGT TAGATATTTT TGCTGATGCG TTGCTTGCTA TTGCGCGTGA AGCTGAAGAA AATCCTGATG TGGTGAAAAT GGCGCCATCA ACAACCGCCG TTAAGCGCCT TGACGAAGCC ACTGCTTCTC GCCAATTGAC TATTTGCTGC ATGTAA
|
Protein sequence | MKEPLIFDLS RPGRKGYSLS PCDVPEVPLE SIIPASLLRK EAVELPEVAE NEVVRHFVRL SNLNYHVDKN MYPLGSCTMK YNPKVNDYTC DLSGFSALHP LQPTSTTQGA LQLMYELSNM LAEIAGMAGV SLQPAAGAHG ELTGILLIKK YHEVRGDKRH KLLVVDSAHG TNPASAALAG YETISVKSNG DGRTDLEDLR SKLDGDVAAL MLTNPNTIGL FEKEIVQIAE MVHANGSLLY MDGANMNALL GITRPGDMGF DVMHYNLHKT FAAPHGGGGP GSGPVGVNEK LLPYLPAPLV VKEGDTYRLT SGGDDSIGRM MNFYGNFAVL VRAYTYIRML GAEGLRRVSE NAIINANYLL SKLLERYELP YPKPVMHEFC LSGDKQKKAH GVKTLDIAKR LLDYGFHAPT IYFPLIVSEA LMIEPTETES KETLDIFADA LLAIAREAEE NPDVVKMAPS TTAVKRLDEA TASRQLTICC M
|
| |