Gene Caul_0457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0457 
Symbol 
ID5897914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp499355 
End bp500824 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content64% 
IMG OID641560943 
Productaldehyde dehydrogenase 
Protein accessionYP_001682092 
Protein GI167644429 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATTG GCGCCCAGAA TTTGCTGATC GGCGGGCGAT GGACCGCAGC GGAGAACGGA 
GGGACCTACG AATCCCTCAA CCCGTTCACC GGGCAGGTGG CGACGCGGGC GGCGGCGGCG
ACGGTCGACG ACGTCGACCA GGCTGTCGCG GCGGCGCATG AAGCCTTCGG CCCGTGGGCC
GCGCTGGCGC CAAGCGAGCG CCGGCGATAC CTCCTGACGG CCGCGGACGC CATCGAAGCG
CGGCTTGAGG AACTCTGCGA GGCTGTCACG GCTGAGATGG GCGGACCGGC CGCTTGGGGC
AAGTATAACG TCAAGGTCCT TGCCGAGAAG ATTCGCTACG CAGCTGGCGC AGCATACCAG
GGGCTAACGG GAGAGGCGAT CCCCTCGGAC AATTCGGCTC GCACCATGGT CGCCATTCGC
AAGCCCGCCG GCGTGGTTGC GAGCATCGTA CCGTGGAATG CGCCTGTCCT GTTGGTCGGG
GCGTCGGTCC CGGCAGCGTT GGTGCTTGGC AACACGGTCG TCATCAAGGC GTCCGAGCAA
ACGCCGCGCA CGCATGGGCT TGTCGCCGCG TGCTTCGAGG ACGCTGGCTT TCCCGCTGGC
GTTGTCAACC TCATCACCAA CGCTCCCGAA GATGCCAGCG GGGTGGTCGA GGCGCTGATT
GCTCATCCTC TTGTGCGTCG GGTGCATTTC ACCGGTTCGA CGCGTGTCGG CCGGATCATC
GCAGAGAAAT GCGCCGCCTA CCTCAAGAAG GTGGTGCTGG AGCTCGGCGG CAAGGCGCCG
TGCATCGTTC TGGCCGATGC CGATCTCGAT CGAGCGGTAA GCGCAGCAGC GTTCGGCTCC
TTTGCCAATT CTGGTCAGGG ATGCCTGTCG ACCGAGCGCA TCATTGTAGA CAAATCGATT
GCGGAGGAAT TCTGCGAGCG ACTTGCCGCC GTGGCCAGGA AAGTCACATG CGGCGATCCA
CGCATGAGCG ACACCGTGCT TGGCCCGGTG ATCAATGATG CTTCCGTCAA GCGGCTGAAG
GAATTGGTCG ATGATGCCGT CGTGTCCGGC GCGCGGCTGC TTGTAGGCGG CACGGCCGAG
GGGCGTTGTT TTGCGCCCAC GGTATTGAGC GGAGTGACCC CCGCGATGCG GGTGTATCAG
GAGGAGTCCT TTGGGCCTCT CGCTTCGGTC GTGTTCGTCG ACGGCCCGGA AGAGGCGCTG
CGGGTGGCGA ATGACAACGA CTATGGATTG TCATCCGCGA TCTTCAGTCG CGACGTGGCG
TCGGCTCTCG AGCTGGCGAA GCGCCTCGAC GTCGGAATGT GTCACATCAA CGGCACGACG
CTCGATGATG AGGCGCAGAT CCCCTTCGGC GGCGTCAAGG ACAGCGGATA CGGCCGGTCC
GGCGGCACAA TCGGGATGGA AGAACTGACC GAGGTGCAGT GGATCACGAT CGAAGGCCCC
AAAGCGCCGC GTTACCCGAT CGCGGAATAG
 
Protein sequence
MDIGAQNLLI GGRWTAAENG GTYESLNPFT GQVATRAAAA TVDDVDQAVA AAHEAFGPWA 
ALAPSERRRY LLTAADAIEA RLEELCEAVT AEMGGPAAWG KYNVKVLAEK IRYAAGAAYQ
GLTGEAIPSD NSARTMVAIR KPAGVVASIV PWNAPVLLVG ASVPAALVLG NTVVIKASEQ
TPRTHGLVAA CFEDAGFPAG VVNLITNAPE DASGVVEALI AHPLVRRVHF TGSTRVGRII
AEKCAAYLKK VVLELGGKAP CIVLADADLD RAVSAAAFGS FANSGQGCLS TERIIVDKSI
AEEFCERLAA VARKVTCGDP RMSDTVLGPV INDASVKRLK ELVDDAVVSG ARLLVGGTAE
GRCFAPTVLS GVTPAMRVYQ EESFGPLASV VFVDGPEEAL RVANDNDYGL SSAIFSRDVA
SALELAKRLD VGMCHINGTT LDDEAQIPFG GVKDSGYGRS GGTIGMEELT EVQWITIEGP
KAPRYPIAE