Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4440 |
Symbol | |
ID | 5901901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4806150 |
End bp | 4807355 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641564958 |
Product | saccharopine dehydrogenase |
Protein accession | YP_001686058 |
Protein GI | 167648395 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1748] Saccharopine dehydrogenase and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAAGG TGCTGGTGAT CGGCGCTGGC GGCGTCGGTT CGGTCGCGGT CCATAAGATG GCGATGAACA CGGACGTGTT TTCGCACATC ACTTTGGCCA GCCGCACGAA GTCGAAGTGC GACGCGATCG CGCAATCCGT GAAGCAGCGA ACCGGCGTGA CCATCGACAC GGCCGCGCTC GACGCCGACG ACGTCGCCGC GACCACGGCG CTGATCCAGG CGGTCAAGCC AGAGCTGGTG GTCAATCTGG CGCTGCCCTA TCAGGATCTG AACATCATGG ACGCCTGTCT GGCGACCGGG GTGAACTATC TCGATACGGC CAACTACGAG CCGCGCGACG AGGCCAAGTT CGAATATAGC TGGCAGTGGG CCTATCAGGA CCGCTTCAAG GAGGCCGGCC TGATGGCCCT GCTGGGCAGC GGCTTCGACC CCGGCGTGAC CTCGGTGTTC ACCACCTACA CCAAGAAGCA CCTGCTGGAC CGGATCGACA CGCTCGACAT CCTGGACTGC AACGGCGGCG ATACCGGCCT GCCCTTCGCC ACCAACTTCA ATCCCGAGAT CAACCTGCGC GAAGTGACCG CGCCCTCGCG GCACTGGGAA AACGGCCAGT GGATCGAGGG GCCGGCGCTG AGCCACAAGC AGGTGTTCGA CTTCGACCAG GTTGGGCCGA AGAACATGTA CCTCATGTAT CATGAGGAGC TGGAATCCCT GGCCAAGTTC TATCCGGAGA TCCAGCGCAT CCGCTTCTGG ATGACGTTCG GCGACTCCTA TCTCAAGCAC CTGGAGGTGC TGGAGAACAT CGGCATGACC CGCATCGAGC CGATGATGTT CCAGGGGCGC GAGATCATCC CCATCGAGTT CCTCAAGGCC CTGCTGCCCG AGCCGTCGTC GCTGGGTCCG ATCACCAAGG GCAAGACCAA TATCGGCACG ATCGCTACGG GCCAGAAGGA CGGCCAGGCC CGGACGGTCT ACGTCAACAA CGTGTGCGAC CACGAGGCCG CCTATGCCGA GACCGGCAAC CAGGCCGTCA GCTACACGAC AGGCGTCCCG GCCATGATCG GCGCGGCCCT GATGATGACC GGCCAATGGA AGGGCGCGGG CGTGTTCAAC ATGGAGCAGC TGGACCCCGA TCCGTTCATG GACATGCTGA ACAAGCACGG CCTGCCCTGG CAGGTTCGCG ACCTCGACGC CCCGCTGGAC TTCTGA
|
Protein sequence | MGKVLVIGAG GVGSVAVHKM AMNTDVFSHI TLASRTKSKC DAIAQSVKQR TGVTIDTAAL DADDVAATTA LIQAVKPELV VNLALPYQDL NIMDACLATG VNYLDTANYE PRDEAKFEYS WQWAYQDRFK EAGLMALLGS GFDPGVTSVF TTYTKKHLLD RIDTLDILDC NGGDTGLPFA TNFNPEINLR EVTAPSRHWE NGQWIEGPAL SHKQVFDFDQ VGPKNMYLMY HEELESLAKF YPEIQRIRFW MTFGDSYLKH LEVLENIGMT RIEPMMFQGR EIIPIEFLKA LLPEPSSLGP ITKGKTNIGT IATGQKDGQA RTVYVNNVCD HEAAYAETGN QAVSYTTGVP AMIGAALMMT GQWKGAGVFN MEQLDPDPFM DMLNKHGLPW QVRDLDAPLD F
|
| |