Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2203 |
Symbol | |
ID | 5899658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2400514 |
End bp | 2401968 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641562695 |
Product | succinylglutamic semialdehyde dehydrogenase |
Protein accession | YP_001683829 |
Protein GI | 167646166 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03240] succinylglutamic semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.87702 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.481934 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGCG GCTTGTTTAT CGACGGCGTC TGGCGCGCGG GCGCCGGCGC TCAGGCGACC TCGGTTGATC CGACCACCGG CGAGGTGATC TGGCGCCAGG CGACGGCCTC GACCGCCGAG GTGGCCGCGG CGGTCGAGGC CGCCCGCAAG GCCTTTCCGG CCTGGGCCGA CCGTTCGCGT GAAGAGCGGA TCGCGGTCCT GCGTCGCTAC AAGGACGTGC TGGTCGCCCG CACGGGAACC TTCGCCGAGG CCTTGAGCCG CGAGACCGGC AAGGCGCTGT GGGAGACCAA GGCTGAGCTC GGTTCGATGG CCGGCAAGGT CGAGGCGTCG ATCAAGGCGT ACGACGAACG CACCGGCGAG CACGCCAACG ACATGGCCTT CGGTCGCGCC GTGCTGCGCC ACCGCGCCCA CGGCGTGATG GCGGTGCTGG GACCGTTCAA CTTCCCGGGC CATCTGCCCA ACGGCCATAT CGTGCCAGCT CTTCTGGCGG GCGACACCGT GGTGTTCAAG CCGTCGGAGG AGACGCCTCT AGCGGGTCAA TTGTTGGTCG AAGCCCTTGA AGAGGCGGGT GTTCCGGCCG GCGTCATCAA CCTGGTGCAG GGCGGTCGCG AGGTCGGACA GGCGCTGATC GACCAGGAGA TCGACGGCCT GCTGTTCACC GGCTCGGCCG CCGCCGGCGC CTTCTTCCGT CGCCATTTCG CCGACCGGCC GGATGTGATC CTGGCCTTGG AGCTGGGCGG CAACAATCCG CTGGTCGTCT GGGACGCCGG CGACCCCGAG GCCGTGGCGG CCCTGATCGT CCAGTCGGCC TTCATCACCA CCGGCCAGCG CTGTTCGTGC GCGCGGCGGT TGATCGTTTC CGACGATGCG GCGGGTCGGG CTGTGATCGA CGCCGTGGCG GCCCTGTCCG AGCGGCTGGT CATCGGCCCG TGGAACGGCG GGCAGGAGCC TTTCATGGGG CCGCTGATCT CCGACCGCGC GGCGGCGATG GCTCTTGCCG GCGCCAAGGC CATGCCGGGC CAGACGCTTC GCGCCATGAC GTCGGTCGAT GGGCTGAGCC GGGCCTTCGT CTCGCCGGGC CTGGTCGATG TGACCGGCGA GACCGTGCCC GACGAGGAAC TGTTCGCTCC GCTGCTGCAG GTGCGCCGGG TCGGCTCGTT CGAGGAGGCC ATCGCAGCCG CCAACGCCAC GCGTTATGGC CTGTCGGCGG GACTTGTCTC CAATGAAACA GCCCATTGGG ATCGTTTCCT GACGCGCATC CGGGCCGGTG TCGTCAACTG GAACCGGCCG ACCACGGGCG CGGCCGGGAC GATGCCGTTC GGCGGGCTAG GCAATTCGGG GAACCATCGT CCCAGCGCCT ATTACGCCGC CGACTACTGC GCCTATCCAG TGGCCAGTTT CGAGGCGGAG AACGTCACCA ATACCCTGGG CGACATCAAG GGCTTGCGCG CGTGA
|
Protein sequence | MSGGLFIDGV WRAGAGAQAT SVDPTTGEVI WRQATASTAE VAAAVEAARK AFPAWADRSR EERIAVLRRY KDVLVARTGT FAEALSRETG KALWETKAEL GSMAGKVEAS IKAYDERTGE HANDMAFGRA VLRHRAHGVM AVLGPFNFPG HLPNGHIVPA LLAGDTVVFK PSEETPLAGQ LLVEALEEAG VPAGVINLVQ GGREVGQALI DQEIDGLLFT GSAAAGAFFR RHFADRPDVI LALELGGNNP LVVWDAGDPE AVAALIVQSA FITTGQRCSC ARRLIVSDDA AGRAVIDAVA ALSERLVIGP WNGGQEPFMG PLISDRAAAM ALAGAKAMPG QTLRAMTSVD GLSRAFVSPG LVDVTGETVP DEELFAPLLQ VRRVGSFEEA IAAANATRYG LSAGLVSNET AHWDRFLTRI RAGVVNWNRP TTGAAGTMPF GGLGNSGNHR PSAYYAADYC AYPVASFEAE NVTNTLGDIK GLRA
|
| |