Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1634 |
Symbol | |
ID | 3747942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 2132227 |
End bp | 2133600 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637774173 |
Product | CBS |
Protein accession | YP_379930 |
Protein GI | 78189592 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG0031] Cysteine synthase [COG3620] Predicted transcriptional regulator with C-terminal CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACAGT ACCCAACCCT TACGCTTACT ACCGAAACGC CTTTACTTCA GCTTCGTCAT CTTGCGCGTA CTATACGTCC AACTATTATG GCGAAGCTTG AATACATGAA TTCGGCATAC TCGCACCATT TTCGAGCAGC ACAAGCTATT GTGCAAGCGG CTGAAGAGCA AGAGCAAATT CATCCGGGTA TGACGTTGGT TGATTGGAGC CTTGGCAGTA GCGCGATTGC ACTTGCTATG GTTGCCGTTA ATCGTGGCTA CAAATTGCTG TTAGCGGTGC CCGACACCAT TGCAAACGAA CAGCAAAACA TGTTGCGTGC CCTTGGCGCT GAGCTTATTA TGACACCAGC CGATGCCCTG CCCGATGAAG CACGAAGCTG CATGATGGTT GCTCACAGCC TTGAGCAAAG CATTCCGCAC GCTTTTTTTG TAGGCATGTA CGATAACCCC TTGAGCTTGC GCATTCATAG CGATATTACC GCACCTGAAT TGCTTCTCCA ATGCAACAAT GCCGTAACGC ACGTTGTGGT GCCGCTTGGC TCAGGCGCGT TAGCATTTGG CATGAGCCGT GCGCTGAAAG CTGCCAACCC CTCACTCCAT ATTATTGGGG TTGAACCCAA AGGTTCCATT TACGCCTCAC TCTTTCAGCG GGGTGAACTT TCTGCCCCAG AGCGTTGGGA TGTGGAAGAG ATGGGCGCTC GCCAACCCTC CCCATTTTGG GAACGTTCAC TTTTGGATGA TGTGGTGCAA ATAAGCGATC ACGACGCTTT TAATTGTGCG CGTGAATTGC TTCGTACAGA GGCAATATTT GCTGGTGGAG CTTCGGGCGC CGCTATGGCA GCCGCTCTTC ACCTTGCCAA GCAATGTAAT GAAGATGATT CTATTGCGGT GATGCTTACC GATTTTGGTG GTTACTCCCT CAGTCGCCTT TATTGCGACG ATTGGATGCG TAAAAAAGGT TTTTATCGCA AAGTAAAATC ATCACTTGAG CAAATTACGG CTGAAGATAT TTTGCAACGC AAAGCACGTC GCAACCTCAT TTTTGCTCAT CCCGAACATA CGCTTGCCGA AGTGTTTGAA ATGATGAAGC AAAACGATGT TTCCCAAATG CCTATTGTTT CATACAACGC ACCTATTGGC AGCATTAGCG AAAACAGGAT TCTTTCTATT CTTATTGAAC ACGATGATGC CATGAATGCT AAAGTGATTG GCTTTATGGA AAAACCTTTT CCCGTATGTT CGCCCGATGC CACCATTTCG GAATTATCGG CTCGCTTGCA ACAACATGCC TCAGGCGTTC TTGTGAATAT GTCGGATGGT AAGCTGCAAC TCCTTACAAA ATCAGATCTT ATAGATGCGC TTACCCACAA GTAA
|
Protein sequence | MQQYPTLTLT TETPLLQLRH LARTIRPTIM AKLEYMNSAY SHHFRAAQAI VQAAEEQEQI HPGMTLVDWS LGSSAIALAM VAVNRGYKLL LAVPDTIANE QQNMLRALGA ELIMTPADAL PDEARSCMMV AHSLEQSIPH AFFVGMYDNP LSLRIHSDIT APELLLQCNN AVTHVVVPLG SGALAFGMSR ALKAANPSLH IIGVEPKGSI YASLFQRGEL SAPERWDVEE MGARQPSPFW ERSLLDDVVQ ISDHDAFNCA RELLRTEAIF AGGASGAAMA AALHLAKQCN EDDSIAVMLT DFGGYSLSRL YCDDWMRKKG FYRKVKSSLE QITAEDILQR KARRNLIFAH PEHTLAEVFE MMKQNDVSQM PIVSYNAPIG SISENRILSI LIEHDDAMNA KVIGFMEKPF PVCSPDATIS ELSARLQQHA SGVLVNMSDG KLQLLTKSDL IDALTHK
|
| |