Gene Caul_2747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2747 
Symbol 
ID5900202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2984288 
End bp2985820 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content66% 
IMG OID641563239 
Productaldehyde dehydrogenase 
Protein accessionYP_001684372 
Protein GI167646709 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.120582 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0601156 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCATCG CCCAGGTCGG CTTCATGGTG ATCGCTGATA AGATGAGCCG CCAGGCCGCT 
TGGGAGGACC TAACCTTGAA CGCGCCCGCC AATCTCAACC TTGACCTGCA CAAAGCCTCG
CTCTCGAAAA TTCTCGAAGC GCAGAAGGCC GCGCACCTGC GACAAGGCGC CCCAACGGCG
GCCCGTCGCA TCGATTGGCT GGACCGTTGC ATCGGCCTGC TGGTCGATCA CCAGGTCGAG
ATCGCCGACG CCCTGAACAC CGATTTTGGC GCGCGCTCCA AGGACGCCAC GGGCCTGACC
GACATCGCCG GCTCGATCGG CCCGCTGAAG TACGCCAAGG AGAATGTCGC CAAGTGGATG
CGCCCCGAAA AGCGCAAGAC CACCCCGGCG ATCCTGGGCC TGTTCGGCGC CAAGGCCGAG
GTCCACTATC AGCCCAAGGG CGTGGTCGGG GTGATCAGCC CGTGGAACTT CCCGGTCAAC
CTGACCTTCG CGCCGCTGGC CGGCGTGTTG GCGGCCGGCA ACCGGGCGAT GATCAAGCCG
TCCGAGTTCA CGCCGATCAC CTCCGAACTG ATGAAGACGA TGTTCGCCAA GGCCTTTTCC
GAGGAGGAGA TCGCGGTGAT CACTGGCGGC CCCGACGTGG GCCAGGCCTT CACCAGCCTG
CCGTTCGACC ACCTGGTCTT CACCGGCGCG ACGTCGGTGG CGCGTCATGT GATGCGAGCG
GCGGCCGAGA ACCTGGTGCC GGTGACCCTG GAGCTGGGCG GCAAGAGCCC GGTGATTCTG
TCGCGGGGGG CCGACATGGC CACGGCGGCG GCGCGGATCA TGAACGGCAA GACCCTCAAC
GCCGGCCAGA TCTGCCTGGC GCCCGACTAT GTGCTGGCGC CGGCCGACCA GATCGACAGC
TTCGTCGCCG AGGCCAAGGC CGCCGTGGCG CGGACCTTCC CGACCCTCAA GGACAATCCC
GACTATACGG CCGTGGTCGC CCAGCGCCAC TATGACCGGA TCAAGGGCCA TGTGGACGAC
GCCCGGGCCA AGGGCGCGAC GATCATCGAG ATCAACCCGG CCGGCGAGGA TCTGAGCCAG
CAGGAGCATC GCAAGATCGC CCCGACCCTG ATCCTCAACC CGACCGACGA CATGACGGTG
ATGCAGGACG AGATCTTCGG TCCCGTCCTG CCGGTGAAGA CCTACGGCAA GGTCGAGGAG
GCGGTAAACT ATATCAACGC CCACGATCGG CCCCTGGGGC TCTACTGGTT CGGGACCGAC
GACGCCGAGC GCGACATGGT GCTGAACCGC ACGACCAGCG GCGGGGTGAC GGTCAATGAC
GTGATTTTCC ACGTCGCCCA GGAGGATCTG CCGTTCGGCG GCGTCGGGCC GGCCGGCATG
GGCTCGTACC ATGGCCGTGA CGGCTTCATG GAGTTCAGCC ACCGCAAGGC GGTGTTCCAT
CAGCTGAAGA AGGACATCGC GCCCATGCTG GCCCTGCGGC CGCCCTATGG CGCGGGGATC
CGCAAGTATC TGGCGAGTCA GATCAAGAAG TAG
 
Protein sequence
MRIAQVGFMV IADKMSRQAA WEDLTLNAPA NLNLDLHKAS LSKILEAQKA AHLRQGAPTA 
ARRIDWLDRC IGLLVDHQVE IADALNTDFG ARSKDATGLT DIAGSIGPLK YAKENVAKWM
RPEKRKTTPA ILGLFGAKAE VHYQPKGVVG VISPWNFPVN LTFAPLAGVL AAGNRAMIKP
SEFTPITSEL MKTMFAKAFS EEEIAVITGG PDVGQAFTSL PFDHLVFTGA TSVARHVMRA
AAENLVPVTL ELGGKSPVIL SRGADMATAA ARIMNGKTLN AGQICLAPDY VLAPADQIDS
FVAEAKAAVA RTFPTLKDNP DYTAVVAQRH YDRIKGHVDD ARAKGATIIE INPAGEDLSQ
QEHRKIAPTL ILNPTDDMTV MQDEIFGPVL PVKTYGKVEE AVNYINAHDR PLGLYWFGTD
DAERDMVLNR TTSGGVTVND VIFHVAQEDL PFGGVGPAGM GSYHGRDGFM EFSHRKAVFH
QLKKDIAPML ALRPPYGAGI RKYLASQIKK