Gene Caul_2203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2203 
Symbol 
ID5899658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2400514 
End bp2401968 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content69% 
IMG OID641562695 
Productsuccinylglutamic semialdehyde dehydrogenase 
Protein accessionYP_001683829 
Protein GI167646166 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03240] succinylglutamic semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.87702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.481934 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG GCTTGTTTAT CGACGGCGTC TGGCGCGCGG GCGCCGGCGC TCAGGCGACC 
TCGGTTGATC CGACCACCGG CGAGGTGATC TGGCGCCAGG CGACGGCCTC GACCGCCGAG
GTGGCCGCGG CGGTCGAGGC CGCCCGCAAG GCCTTTCCGG CCTGGGCCGA CCGTTCGCGT
GAAGAGCGGA TCGCGGTCCT GCGTCGCTAC AAGGACGTGC TGGTCGCCCG CACGGGAACC
TTCGCCGAGG CCTTGAGCCG CGAGACCGGC AAGGCGCTGT GGGAGACCAA GGCTGAGCTC
GGTTCGATGG CCGGCAAGGT CGAGGCGTCG ATCAAGGCGT ACGACGAACG CACCGGCGAG
CACGCCAACG ACATGGCCTT CGGTCGCGCC GTGCTGCGCC ACCGCGCCCA CGGCGTGATG
GCGGTGCTGG GACCGTTCAA CTTCCCGGGC CATCTGCCCA ACGGCCATAT CGTGCCAGCT
CTTCTGGCGG GCGACACCGT GGTGTTCAAG CCGTCGGAGG AGACGCCTCT AGCGGGTCAA
TTGTTGGTCG AAGCCCTTGA AGAGGCGGGT GTTCCGGCCG GCGTCATCAA CCTGGTGCAG
GGCGGTCGCG AGGTCGGACA GGCGCTGATC GACCAGGAGA TCGACGGCCT GCTGTTCACC
GGCTCGGCCG CCGCCGGCGC CTTCTTCCGT CGCCATTTCG CCGACCGGCC GGATGTGATC
CTGGCCTTGG AGCTGGGCGG CAACAATCCG CTGGTCGTCT GGGACGCCGG CGACCCCGAG
GCCGTGGCGG CCCTGATCGT CCAGTCGGCC TTCATCACCA CCGGCCAGCG CTGTTCGTGC
GCGCGGCGGT TGATCGTTTC CGACGATGCG GCGGGTCGGG CTGTGATCGA CGCCGTGGCG
GCCCTGTCCG AGCGGCTGGT CATCGGCCCG TGGAACGGCG GGCAGGAGCC TTTCATGGGG
CCGCTGATCT CCGACCGCGC GGCGGCGATG GCTCTTGCCG GCGCCAAGGC CATGCCGGGC
CAGACGCTTC GCGCCATGAC GTCGGTCGAT GGGCTGAGCC GGGCCTTCGT CTCGCCGGGC
CTGGTCGATG TGACCGGCGA GACCGTGCCC GACGAGGAAC TGTTCGCTCC GCTGCTGCAG
GTGCGCCGGG TCGGCTCGTT CGAGGAGGCC ATCGCAGCCG CCAACGCCAC GCGTTATGGC
CTGTCGGCGG GACTTGTCTC CAATGAAACA GCCCATTGGG ATCGTTTCCT GACGCGCATC
CGGGCCGGTG TCGTCAACTG GAACCGGCCG ACCACGGGCG CGGCCGGGAC GATGCCGTTC
GGCGGGCTAG GCAATTCGGG GAACCATCGT CCCAGCGCCT ATTACGCCGC CGACTACTGC
GCCTATCCAG TGGCCAGTTT CGAGGCGGAG AACGTCACCA ATACCCTGGG CGACATCAAG
GGCTTGCGCG CGTGA
 
Protein sequence
MSGGLFIDGV WRAGAGAQAT SVDPTTGEVI WRQATASTAE VAAAVEAARK AFPAWADRSR 
EERIAVLRRY KDVLVARTGT FAEALSRETG KALWETKAEL GSMAGKVEAS IKAYDERTGE
HANDMAFGRA VLRHRAHGVM AVLGPFNFPG HLPNGHIVPA LLAGDTVVFK PSEETPLAGQ
LLVEALEEAG VPAGVINLVQ GGREVGQALI DQEIDGLLFT GSAAAGAFFR RHFADRPDVI
LALELGGNNP LVVWDAGDPE AVAALIVQSA FITTGQRCSC ARRLIVSDDA AGRAVIDAVA
ALSERLVIGP WNGGQEPFMG PLISDRAAAM ALAGAKAMPG QTLRAMTSVD GLSRAFVSPG
LVDVTGETVP DEELFAPLLQ VRRVGSFEEA IAAANATRYG LSAGLVSNET AHWDRFLTRI
RAGVVNWNRP TTGAAGTMPF GGLGNSGNHR PSAYYAADYC AYPVASFEAE NVTNTLGDIK
GLRA