Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4757 |
Symbol | |
ID | 5902219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5142326 |
End bp | 5143312 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641565276 |
Product | cysteine synthase A |
Protein accession | YP_001686375 |
Protein GI | 167648712 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.157844 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGACG CCTCCTCGAC CTACGACGCC GCGCGCTTCA AGCGGGCCGG ACGCGGCAAG ATCTTCGACT CGATCCTCGA CACGGTCGGC GACACGCCGC TGGTCCGCCT GCCCAACCTG ACCGCCGAGC TGAAGCCCAA GGGCACGGTC GTGGCCAAGC TGGAGTTCTT CAACCCGCTG GCCTCGGTCA AGGACCGCAT CGGCGTGGCG ATGATCGAGT ATCTGGAGGC GAAGGGGCTC TTGAAGCCGG GCGGCGTCAT CGTCGAGCCG ACCAGCGGCA ACACCGGCAT CGCCCTGGCC TTCGTGGCCG CGTCCAAGGG CTACAAGCTG ACCCTGGTCA TGCCCGAGAG CATGTCGATC GAGCGTCGCA AGATGCTGCT GCTGCTCGGC GCCAAGCTGG AGCTGACCCC CGCCGAGAAG GGCATGCGCG GCGCCGTGGC CCGCGCCCAG GAGATCATCG AGGCCACCCC CGGCGCGGTG ATGCCCCAGC AGTTCGAAAA CACCGCCAAC CCCCTGATCC ACCGGGTCTC GACCGCCGAG GAAATCTGGA ACGACACGGC CGGCGCCGTG GACGCCGTGG TCTCGGGCGT CGGCACCGGC GGCACCATCT CGGGCGTCGG CCAGGTGCTG AAGGCCCGCA AGCCGTCGGT GAAGATGGTG GCGGTCGAGC CGGAGGCCTC CGCCGTGCTG TCGGGCGGCG CGCCCGGCCC GCACAAGATC CAGGGCATCG GCGCCGGCTT CGTGCCGGGC ATCCTCGATC GCGGGGTGAT CGACGAGATC ATCCAGGTCA GCAATGACGA CAGCTTCGCC ATGGCCCGCA AGGCGGCCTC GACCGAGGGC CTGCCGGTGG GCATCAGCTC GGGCGCGGCC CTGACCGCCG CCTTCGACCT GGCCATGCGC GACGAATACG CCGGCAAGCT GATCGTGGCG ATCATCCCCA GCTTCGCCGA ACGCTACCTG TCGACGGCGC TGTTCGAAGG TCTGTGA
|
Protein sequence | MDDASSTYDA ARFKRAGRGK IFDSILDTVG DTPLVRLPNL TAELKPKGTV VAKLEFFNPL ASVKDRIGVA MIEYLEAKGL LKPGGVIVEP TSGNTGIALA FVAASKGYKL TLVMPESMSI ERRKMLLLLG AKLELTPAEK GMRGAVARAQ EIIEATPGAV MPQQFENTAN PLIHRVSTAE EIWNDTAGAV DAVVSGVGTG GTISGVGQVL KARKPSVKMV AVEPEASAVL SGGAPGPHKI QGIGAGFVPG ILDRGVIDEI IQVSNDDSFA MARKAASTEG LPVGISSGAA LTAAFDLAMR DEYAGKLIVA IIPSFAERYL STALFEGL
|
| |