Gene Caul_3617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3617 
Symbol 
ID5901072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3903614 
End bp3905152 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content73% 
IMG OID641564128 
Productaldehyde dehydrogenase 
Protein accessionYP_001685242 
Protein GI167647579 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAG CCGCCCCCCA AGCCGCTTTG CGGGCCCTCA ACCCCGCCAC CAACGAACAC 
TTTGGTCCCA GCTTCCCAGA GCCCAGCGCC GCCCAGATCG AGGCGGCCTG CGCCGCCGCC
GCGGCCGCGT TCGACGCCTA TCGCGAGACC GACCTGGAAA CCCGCGCGGC CTTCCTCGAG
GGAATCGCCA CCGAGATCGA GGCCCTGGGC GACGCGTTGA TCCAGACCGC CATGGCCGAG
ACCGGCCTGC CCCAGGCCCG CATCACCGGC GAGCGCGGCC GCACCTGCGG CCAGCTGCGC
CTGTTCGCCC AGGTCGTGCG CCGCGGCGAC TGGATCGGCG CGCGGATCGA CCCGGCCATG
CCCGAGCGCA CGCCCCTGCC CCGCGCCGAC CTGCGCCAGC GCTTCATCCC GCTGGGTCCG
GTCGTGGTGT TCGGAGCCAG CAACTTCCCA CTGGCCTTCT CGACGGCCGG CGGCGACACC
GCCTCGGCCC TGGCGGCCGG TTGCCCGGTG ATCGTCAAGG GCCACTCGGC CCACCCCAAC
ACCGGCGCGA TGATCGGCGG CGCGATCGAC AAGGCGGTCA AGGCCGCCGG CCTGCCCGCC
GGGGTCTTCG CCATCCTGAT CGGCCAGCAG CGCACCCTGG GCGCCGGCCT GGTCGCCGAT
CCGCGCATCA AGGCCGTGGG CTTCACCGGT TCTCGCGCCG GCGGCGTCGC CTTCATGCGG
ATCGCCGCGG GGCGTCCCGA GCCGATCCCG GTCTTCGCCG AGATGAGCAG CATCAACCCG
GTGGTCCTCA TGCCCGCCGC CCTGGCCGCC CGGGCCGAGG CCCTGGGGAC GGCCTTCGTC
GGCTCGCTGA CGATGGGCGC GGGCCAGTTC TGCACCAATC CCGGCCTGGT CTTCGCCCTG
GGCGGCCCCG ACCTGGATCG TTTCGAAGCC GCCGCCGTCG CCGCCCTGAC CGCCGCCCAG
CCGCAGGTCA TGCTGACGCC CGGCATCTTC GGCGCCTATG AGCAGGGGGT GAACCAATTG
CTCGACCGCG ACGGCGTCAC GCTGCTGGCG CGCGGCTGCG TCGGCGACGG CGTCAACCAG
GCGGTCGGCG CGCTGTTCTC GGTCGATGTC GAGACCTTCC AGCGCGACGC GGTGCTGAGC
CATGAGGTGT TCGGCTCGTC GTCGCTGATC GTGCGGGTGT CGGACGCCGC CCAACTGGCC
GGCGCGTTGG AAGGGCTGGA GGGCCAACTG ACCGCCACCC TGCAGATGGA TCCCGCCGAC
GCCGAGGCCG CGCGCGGCCT GATGCCGATC CTGGAGCGCA AGGCCGGCCG CATCCTGGCC
AATGGCTGGC CGACCGGGGT CGAGGTCTCG CACGCCATGG TCCACGGCGG CCCGTTCCCG
GCCACGTCCG ACCCGCGCGG AACGTCGGTG GGCACGCGGG CCATCGAGCG GTTCCTGCGG
CCGGTCTGCT ACCAGGACAT CCCCGATACG CTGCTGCCGC CAGCCCTGAA GGCGGACAAT
CCGCTGGGCG TGCGGCGGGC CGTGGACGGG GTGCTGTAA
 
Protein sequence
MAEAAPQAAL RALNPATNEH FGPSFPEPSA AQIEAACAAA AAAFDAYRET DLETRAAFLE 
GIATEIEALG DALIQTAMAE TGLPQARITG ERGRTCGQLR LFAQVVRRGD WIGARIDPAM
PERTPLPRAD LRQRFIPLGP VVVFGASNFP LAFSTAGGDT ASALAAGCPV IVKGHSAHPN
TGAMIGGAID KAVKAAGLPA GVFAILIGQQ RTLGAGLVAD PRIKAVGFTG SRAGGVAFMR
IAAGRPEPIP VFAEMSSINP VVLMPAALAA RAEALGTAFV GSLTMGAGQF CTNPGLVFAL
GGPDLDRFEA AAVAALTAAQ PQVMLTPGIF GAYEQGVNQL LDRDGVTLLA RGCVGDGVNQ
AVGALFSVDV ETFQRDAVLS HEVFGSSSLI VRVSDAAQLA GALEGLEGQL TATLQMDPAD
AEAARGLMPI LERKAGRILA NGWPTGVEVS HAMVHGGPFP ATSDPRGTSV GTRAIERFLR
PVCYQDIPDT LLPPALKADN PLGVRRAVDG VL