Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0457 |
Symbol | |
ID | 5897914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 499355 |
End bp | 500824 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641560943 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001682092 |
Protein GI | 167644429 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATTG GCGCCCAGAA TTTGCTGATC GGCGGGCGAT GGACCGCAGC GGAGAACGGA GGGACCTACG AATCCCTCAA CCCGTTCACC GGGCAGGTGG CGACGCGGGC GGCGGCGGCG ACGGTCGACG ACGTCGACCA GGCTGTCGCG GCGGCGCATG AAGCCTTCGG CCCGTGGGCC GCGCTGGCGC CAAGCGAGCG CCGGCGATAC CTCCTGACGG CCGCGGACGC CATCGAAGCG CGGCTTGAGG AACTCTGCGA GGCTGTCACG GCTGAGATGG GCGGACCGGC CGCTTGGGGC AAGTATAACG TCAAGGTCCT TGCCGAGAAG ATTCGCTACG CAGCTGGCGC AGCATACCAG GGGCTAACGG GAGAGGCGAT CCCCTCGGAC AATTCGGCTC GCACCATGGT CGCCATTCGC AAGCCCGCCG GCGTGGTTGC GAGCATCGTA CCGTGGAATG CGCCTGTCCT GTTGGTCGGG GCGTCGGTCC CGGCAGCGTT GGTGCTTGGC AACACGGTCG TCATCAAGGC GTCCGAGCAA ACGCCGCGCA CGCATGGGCT TGTCGCCGCG TGCTTCGAGG ACGCTGGCTT TCCCGCTGGC GTTGTCAACC TCATCACCAA CGCTCCCGAA GATGCCAGCG GGGTGGTCGA GGCGCTGATT GCTCATCCTC TTGTGCGTCG GGTGCATTTC ACCGGTTCGA CGCGTGTCGG CCGGATCATC GCAGAGAAAT GCGCCGCCTA CCTCAAGAAG GTGGTGCTGG AGCTCGGCGG CAAGGCGCCG TGCATCGTTC TGGCCGATGC CGATCTCGAT CGAGCGGTAA GCGCAGCAGC GTTCGGCTCC TTTGCCAATT CTGGTCAGGG ATGCCTGTCG ACCGAGCGCA TCATTGTAGA CAAATCGATT GCGGAGGAAT TCTGCGAGCG ACTTGCCGCC GTGGCCAGGA AAGTCACATG CGGCGATCCA CGCATGAGCG ACACCGTGCT TGGCCCGGTG ATCAATGATG CTTCCGTCAA GCGGCTGAAG GAATTGGTCG ATGATGCCGT CGTGTCCGGC GCGCGGCTGC TTGTAGGCGG CACGGCCGAG GGGCGTTGTT TTGCGCCCAC GGTATTGAGC GGAGTGACCC CCGCGATGCG GGTGTATCAG GAGGAGTCCT TTGGGCCTCT CGCTTCGGTC GTGTTCGTCG ACGGCCCGGA AGAGGCGCTG CGGGTGGCGA ATGACAACGA CTATGGATTG TCATCCGCGA TCTTCAGTCG CGACGTGGCG TCGGCTCTCG AGCTGGCGAA GCGCCTCGAC GTCGGAATGT GTCACATCAA CGGCACGACG CTCGATGATG AGGCGCAGAT CCCCTTCGGC GGCGTCAAGG ACAGCGGATA CGGCCGGTCC GGCGGCACAA TCGGGATGGA AGAACTGACC GAGGTGCAGT GGATCACGAT CGAAGGCCCC AAAGCGCCGC GTTACCCGAT CGCGGAATAG
|
Protein sequence | MDIGAQNLLI GGRWTAAENG GTYESLNPFT GQVATRAAAA TVDDVDQAVA AAHEAFGPWA ALAPSERRRY LLTAADAIEA RLEELCEAVT AEMGGPAAWG KYNVKVLAEK IRYAAGAAYQ GLTGEAIPSD NSARTMVAIR KPAGVVASIV PWNAPVLLVG ASVPAALVLG NTVVIKASEQ TPRTHGLVAA CFEDAGFPAG VVNLITNAPE DASGVVEALI AHPLVRRVHF TGSTRVGRII AEKCAAYLKK VVLELGGKAP CIVLADADLD RAVSAAAFGS FANSGQGCLS TERIIVDKSI AEEFCERLAA VARKVTCGDP RMSDTVLGPV INDASVKRLK ELVDDAVVSG ARLLVGGTAE GRCFAPTVLS GVTPAMRVYQ EESFGPLASV VFVDGPEEAL RVANDNDYGL SSAIFSRDVA SALELAKRLD VGMCHINGTT LDDEAQIPFG GVKDSGYGRS GGTIGMEELT EVQWITIEGP KAPRYPIAE
|
| |