Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0494 |
Symbol | |
ID | 5897949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 536249 |
End bp | 537364 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641560977 |
Product | saccharopine dehydrogenase |
Protein accession | YP_001682126 |
Protein GI | 167644463 |
COG category | [S] Function unknown |
COG ID | [COG3268] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.961307 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCTTC ATGGGCGATG GCATAGGGGC GGCTCGAGCA GGGGAGAGAC CATGACCGCC GTCTGGATAC TGGGCGCCAC GGGCCGCACC GGCAGCGTGA TCGCGACGAA CCTGGCCGCC GCCGGGGTCG GGTTGGTCCT TGTCGGGCGA GATGGCCCCG CTCTGCAACA TTTGGCGGAC AAGATCGGCG GAAATCCAAG GGTCCTTGCG ACTGCCAGCC TTGAGAAGAT CAAGACCGAA CTCGACGGAG CCGGCCCGAC CGTGGTCGTC AACCTCATCG GGCCATTCGC TGAAACGGCG CTACCTTTCA TAAAGGCATG CGCCCCCGGC AGCGGGTATC TTGATCTCTC CAATGACCGC GCCGCGACAG CAGCAATTCT CGATCTGGAT CAAAAGGCCC GGACAACCGG CCGATGCCTG GTCAGCGGTG CGGGCTGGGG CGTGCTCGCA GCGGAGAGCA CGGTGCTCAT GCTCTGCAAG GATCGACCGC CCGCAGCGCG GGTGAGAGTC GATCTGGCGC CTTTCATCAA CGCGTCCGGC CGGATCGGCG AAACGTTCGC CGCCACCCTG GTCGAGGCGA TGGCCGTCGG CGCGCAAATC TATGAAGACG GTCGGCTGAC TCGAGCCCGC ATCGGAGACC GGAGCGAAAC CCTGATCGCC CCCGACGGAT CGAAAATCCG GACGGGCGTG GTTTCCAGTG GCGATCTGGA GGCGGCCCGA CGTGCGAGCG GCGCGGCCTT CGCGGTGGCC GCCTCGACCC TGGCGCCCAG CTCGGGGGGC GCACGCGCGG CGATGTCCGC GATTGTGTTC CTGCTCGGCT TCCGCAGCGC CCGAGAAGTC GCGAAACGGC TCTTGGCCAA TGTCGTCGCG CCGCCCGCCA AGGGGGCGCC CAAATCATCC TGGGCTCATG CCAGGGTCGA GTGGGCGGAC GGAACGATGC GCGAGACCTG GCTCCGAGCG GGCGAGGGCA TGGCTTTCAC GTGCAAGGTC GCCACCGAGG TCGCTCTTCG GCTTTCACGC GGCGAGGGGC GGCCGGGGGC CTTCACGCCA GCCGCGCTAT TTGGGCCGGA ACTGGCCGAG GCGGCCGGAG CGAAATTTAT CGGTGAGCGA AGGTGA
|
Protein sequence | MALHGRWHRG GSSRGETMTA VWILGATGRT GSVIATNLAA AGVGLVLVGR DGPALQHLAD KIGGNPRVLA TASLEKIKTE LDGAGPTVVV NLIGPFAETA LPFIKACAPG SGYLDLSNDR AATAAILDLD QKARTTGRCL VSGAGWGVLA AESTVLMLCK DRPPAARVRV DLAPFINASG RIGETFAATL VEAMAVGAQI YEDGRLTRAR IGDRSETLIA PDGSKIRTGV VSSGDLEAAR RASGAAFAVA ASTLAPSSGG ARAAMSAIVF LLGFRSAREV AKRLLANVVA PPAKGAPKSS WAHARVEWAD GTMRETWLRA GEGMAFTCKV ATEVALRLSR GEGRPGAFTP AALFGPELAE AAGAKFIGER R
|
| |