Gene Cag_1658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1658 
Symbol 
ID3747676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2156441 
End bp2157886 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content48% 
IMG OID637774196 
Productglycine dehydrogenase subunit 2 
Protein accessionYP_379953 
Protein GI78189615 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAC CCCTCATTTT TGACCTCTCT CGCCCCGGGC GTAAGGGATA CAGCTTGTCG 
CCATGCGACG TGCCTGAAGT TCCACTTGAA TCCATTATTC CAGCATCGTT GCTTCGTAAG
GAGGCGGTGG AGTTGCCCGA AGTGGCAGAA AATGAGGTGG TGCGCCACTT TGTGCGCCTT
TCAAACCTCA ACTATCATGT TGATAAAAAT ATGTACCCGT TGGGCAGTTG TACCATGAAG
TACAATCCCA AAGTGAATGA TTACACTTGT GATTTGTCGG GCTTTAGCGC GCTCCATCCA
TTGCAGCCCA CCAGCACAAC GCAAGGTGCT TTGCAGTTGA TGTATGAGTT ATCCAACATG
TTAGCTGAAA TTGCTGGCAT GGCTGGCGTG AGTTTGCAAC CAGCCGCAGG TGCACATGGT
GAGTTAACGG GCATTTTGCT GATTAAAAAA TATCATGAAG TGCGTGGCGA TAAGCGCCAT
AAGCTCTTGG TGGTAGATTC AGCGCATGGC ACGAACCCCG CTTCTGCCGC ACTTGCGGGC
TACGAAACCA TCTCCGTTAA AAGCAATGGC GATGGACGTA CTGACCTTGA GGATTTACGT
AGCAAGTTAG ATGGCGATGT TGCAGCGCTT ATGCTTACCA ATCCCAATAC GATTGGATTG
TTTGAAAAAG AGATTGTGCA AATTGCCGAA ATGGTACACG CCAATGGTAG CTTACTTTAT
ATGGATGGCG CCAATATGAA TGCGCTGCTT GGTATTACTC GCCCTGGTGA TATGGGTTTT
GATGTTATGC ACTACAATCT CCATAAAACC TTTGCAGCTC CGCACGGCGG CGGTGGTCCA
GGTAGCGGTC CCGTTGGTGT GAATGAAAAA CTACTGCCAT ACCTTCCTGC TCCGCTTGTT
GTTAAAGAGG GCGACACTTA CCGCTTAACA TCGGGTGGCG ATGACTCCAT TGGGCGTATG
ATGAACTTTT ATGGCAACTT TGCTGTCTTG GTGCGTGCCT ACACTTACAT TCGGATGTTG
GGAGCTGAAG GGCTGCGCCG AGTTTCGGAA AACGCCATTA TTAACGCCAA CTACCTTTTG
AGCAAATTGC TTGAGCGCTA CGAGCTGCCT TATCCAAAAC CTGTGATGCA CGAATTTTGC
TTGTCGGGTG ATAAGCAGAA AAAAGCGCAT GGCGTTAAAA CGCTTGATAT TGCAAAGCGT
TTGCTTGATT ATGGGTTCCA TGCTCCAACC ATTTACTTCC CGCTTATTGT AAGCGAAGCC
TTAATGATTG AGCCAACTGA AACCGAGTCG AAAGAAACGT TAGATATTTT TGCTGATGCG
TTGCTTGCTA TTGCGCGTGA AGCTGAAGAA AATCCTGATG TGGTGAAAAT GGCGCCATCA
ACAACCGCCG TTAAGCGCCT TGACGAAGCC ACTGCTTCTC GCCAATTGAC TATTTGCTGC
ATGTAA
 
Protein sequence
MKEPLIFDLS RPGRKGYSLS PCDVPEVPLE SIIPASLLRK EAVELPEVAE NEVVRHFVRL 
SNLNYHVDKN MYPLGSCTMK YNPKVNDYTC DLSGFSALHP LQPTSTTQGA LQLMYELSNM
LAEIAGMAGV SLQPAAGAHG ELTGILLIKK YHEVRGDKRH KLLVVDSAHG TNPASAALAG
YETISVKSNG DGRTDLEDLR SKLDGDVAAL MLTNPNTIGL FEKEIVQIAE MVHANGSLLY
MDGANMNALL GITRPGDMGF DVMHYNLHKT FAAPHGGGGP GSGPVGVNEK LLPYLPAPLV
VKEGDTYRLT SGGDDSIGRM MNFYGNFAVL VRAYTYIRML GAEGLRRVSE NAIINANYLL
SKLLERYELP YPKPVMHEFC LSGDKQKKAH GVKTLDIAKR LLDYGFHAPT IYFPLIVSEA
LMIEPTETES KETLDIFADA LLAIAREAEE NPDVVKMAPS TTAVKRLDEA TASRQLTICC
M