Gene Caul_4562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4562 
Symbol 
ID5902023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4939897 
End bp4941063 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content69% 
IMG OID641565081 
Productsuccinyl-diaminopimelate desuccinylase 
Protein accessionYP_001686180 
Protein GI167648517 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01246] succinyl-diaminopimelate desuccinylase, proteobacterial clade 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACCA GCCCCGCTCC CGAACCTGTC AGCATCGACT CCGTCGCCCT CGCCCAGGCC 
CTGATCCGCC GTCCGTCCGT GACCCCGGCC GACGAAGGGG CGATGGACGT TCTGCAACGC
CAGCTGGAGG CCCTGGGCTT CAACTGCCGC CGCATGAAGT TCGGCGAGAT CGAGAACCTC
TACGCCCGGC GCGGGACGGA GCGCCCCAAC CTCTGCTTCG CGGGCCACAC CGACGTGGTG
CCGGTCGGCG ACAGCGCAGC TTGGACGCAG GGTCCGTTCG AGGCCGAGAT CCAGGACGGC
ATGCTCTACG GCCGCGGCGC GGTCGACATG AAGAGCGCCA TCGCCGCCTT CGTCGCCGCG
GTTTCAAATC TGCCTCGAGA TCTTCCAGGC TCGCTGAGCT TCCTGATCAC CGGCGACGAG
GAGGGCGTGG CCGAGGACGG CACCGTCCGC GTCGTCCAGG CCCTGGCTGC CGAGGGCGAG
GTCATCGACC ACTGCATCGT CGGCGAACCC ACCAGCGCCA ACCTGCTGGG CGACATGGTC
AAGATCGGCC GGCGCGGCAG CATCAACGCC TGGATCGCCG TGGACGGTCG GCAGGGCCAC
GTGGCCTATC CGCAGCGCGC CGCCAACCCG ATCCCGGTGA TGGTCGACAT CCTCTCGCGC
CTGCAAAGCC GGGTGCTGGA CGAGGGCTAT GAAGGCTTCC AGCCCTCGAA CCTGGAGGTG
ACCACGATCG ACGTCGGCAA CACCGCCACC AACGTCATCC CCGCCTCGGC CAAGGCCCGG
ATCAATATCC GCTTCAACCC CGCCCACCAG GGCAAGGACC TGCGGGCCTG GATTGAACAG
GAGTGCCGCG ACGCCGCCGA TGGCTTTTCC GGCCGGGTCG AGGCCCTGTG CAAGATCGGC
GGCGAGGCCT TTCTGACCCA GCCGGGCGCC TTCACCGACG TGATCGTGGC CGCCGTCGGC
GACGCCACCG GCCGCGTCCC GGAGCTGTCG ACCACCGGCG GCACCAGCGA CGCCCGGTTC
ATCCGCAGCC TGTGCCCGGT GGTCGAGTTC GGCCTGGTCG GCGCCACCAT GCATGCGGTC
GACGAACGCG TGCCGGTGCA GGAAATCCGG GACCTCGCCA ACATCTACCA GGCCCTGATC
GGCCGCTATT TCGCGGCCTT CGCCTGA
 
Protein sequence
MMTSPAPEPV SIDSVALAQA LIRRPSVTPA DEGAMDVLQR QLEALGFNCR RMKFGEIENL 
YARRGTERPN LCFAGHTDVV PVGDSAAWTQ GPFEAEIQDG MLYGRGAVDM KSAIAAFVAA
VSNLPRDLPG SLSFLITGDE EGVAEDGTVR VVQALAAEGE VIDHCIVGEP TSANLLGDMV
KIGRRGSINA WIAVDGRQGH VAYPQRAANP IPVMVDILSR LQSRVLDEGY EGFQPSNLEV
TTIDVGNTAT NVIPASAKAR INIRFNPAHQ GKDLRAWIEQ ECRDAADGFS GRVEALCKIG
GEAFLTQPGA FTDVIVAAVG DATGRVPELS TTGGTSDARF IRSLCPVVEF GLVGATMHAV
DERVPVQEIR DLANIYQALI GRYFAAFA