Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2280 |
Symbol | |
ID | 4568696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2612345 |
End bp | 2613724 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639766842 |
Product | cysteine synthase |
Protein accession | YP_912696 |
Protein GI | 119358052 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG0031] Cysteine synthase [COG3620] Predicted transcriptional regulator with C-terminal CBS domains |
TIGRFAM ID | [TIGR01137] cystathionine beta-synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGAATC ACGATATCCT TGCTCTTGGA GCTGAAAGTC CTCTTGTGCT CATAAAGCAA CTTGCCCGGC AGATCAAGCC GAAAGTTATG GCAAAACTGG AGTATATGAA TCCGGCATGC TCGCACTACT ATCGGGTTGC TTCAGCCATT ATTCTTGATG CAGAGCAGCG AAACCTCATT CACCCCGGGA TGACTCTTGT TGACTGGACA TATGGCGACA GCGGTATTGC CCTTGCCATG GCCGGTTTGA GGAAGGGATA CAATCTCCTG CTTGTTGCTC CGGATAAAAT CTCACGTGAA AAACAGGAGT TGCTCAAGGC GCTTGGAGCG GAGACGGTTA TTACCCCATC GGCGGCTCTT CCGGGAGAAC CGCGTAGTTG TATGAATGTT GCGGAAAGTC TTGTGAAAAA AATTTCCAAT GCTTTTTTTG CCAACATGTA TGAAAATCCT ATAGGCATGG AGGTGCATCG TCAGTTCACT TCGAGGGAGA TTTTCGAACA GACTGACGGA GCCGTTACCC ATGTGTTTGT GCCCATGGTT TCAGGCGCGA TGATTTCCGG GATCGGATCT TTTTTCAAGC AGAAAAAACC TTCGGTCAGG ATTATCGGCG TCGAGCCTGA GGGTTCCATC TATTCAGGTC TTTTCAGGGA AGGGACGCCT GGAAAAACCG GTTTTTATGA ACTTGAAGAG ATAGGAGCGG TCGCTCCTTC AGGGTTGTGG GATTCTTCAT TCATTGATGA CATTGTGCAG GTAAGTGATT ACGATGCGTT TAATTGCGGG CGTGAACTGC TCAGGGCTGA ATCAGTATTC GCCGGCGGCT CCTCTGGCGC TGTTATGGCG GCGGTACTGC GTGCCGGACG GCACTACACT GAAGATGATT GTGTTGTTGC CCTCATGAAC GATTTCGGAG GTTTTTACCT CAGCAAGATG TATCGTGATG AATGGATGAA AGCGAAGGGG CTGTATCGTA AGGCGAAATC GGCACTTGAC CAGATTACCG CAGAAGATAT TCTTCTGCTG AAAGGAAGCA AGGATCTTAT TTTCGCTTAT CCCGAGAACA CGCTGGCCGA AGTTTTTGAG ATGATGAAGC AGAGCGATGT TTCGCAGCTT CCCATTGTTT CCTATGGAAC TCCGATAGGC AGCATCAGCG AAAACAGGAT TTTATCGATT CTTATTGAAA ATGATGAGGC GATGAACTCG AAGGTTGTGG GTTTCATGGA GCAGCCTTTC GCCGTTTGTC AGCCGGGAGC AACCATATCG GATCTCTCGT CCAAGCTTCA GGAAAGTGCT TCAGGAGTAC TTATTGCGCT GTCAGACGGG CGCTTGCAAT TGCTCACGAA ATCGGATCTT ATTGACGCTT TGACTCACAA GCAGGGTTGA
|
Protein sequence | MSNHDILALG AESPLVLIKQ LARQIKPKVM AKLEYMNPAC SHYYRVASAI ILDAEQRNLI HPGMTLVDWT YGDSGIALAM AGLRKGYNLL LVAPDKISRE KQELLKALGA ETVITPSAAL PGEPRSCMNV AESLVKKISN AFFANMYENP IGMEVHRQFT SREIFEQTDG AVTHVFVPMV SGAMISGIGS FFKQKKPSVR IIGVEPEGSI YSGLFREGTP GKTGFYELEE IGAVAPSGLW DSSFIDDIVQ VSDYDAFNCG RELLRAESVF AGGSSGAVMA AVLRAGRHYT EDDCVVALMN DFGGFYLSKM YRDEWMKAKG LYRKAKSALD QITAEDILLL KGSKDLIFAY PENTLAEVFE MMKQSDVSQL PIVSYGTPIG SISENRILSI LIENDEAMNS KVVGFMEQPF AVCQPGATIS DLSSKLQESA SGVLIALSDG RLQLLTKSDL IDALTHKQG
|
| |