Gene Caul_0989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0989 
Symbol 
ID5898444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1044832 
End bp1046262 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content68% 
IMG OID641561471 
Producthypothetical protein 
Protein accessionYP_001682617 
Protein GI167644954 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.306467 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGGCA AGACCAAGAC CTTCCTTCTG ACGGCGGCGG CGATGTCCGC CCTGGCCATG 
ACCTCGGCCG CCCGCGCTCA GTCAGCGCCT CAGGACTCCG GCGAAACCGC CTTCCGCGGG
CTCTACAAGG AACTGGTCGA GATCAACACC ACCGCTTCGG TCGGCAGTTG CACGGCCGCC
GCCGAGGCGA TGGGCGCGCG GCTGAAGGCG GCGGGCTTGC CCGACAGCGA CGTGCGCGTG
CTGGCCCCGT CCGACTATCC CAAGTTCGGC GCCCTGGTCG CCACCCTGCA CGGCAAGGAC
CCCAAGGCCG GCGCGATCCT GCTGCTGGCC CACATCGACG TGGTCGAGGC CAAGCGCGAG
GACTGGGTGC GCGATCCGTT CAAGCTGGTC GAGGAGGATG GCTATCTCTA CGGGCGCGGG
ACCAGCGACG ACAAGGCCAT GGCCGCGATC TTCACCGACA GCCTGGTTCG CTACAAGGCC
GAGGGTTTCA AGCCCAGGCG CGACATCAAG CTGGCCCTGA CCTGCGGCGA GGAGGGCGGG
CCGTTCAACA GCGTGCCCTG GCTGCTGGAG AAATATCCCG AGACCCTGAA GGCCGATTTC
GCGCTCAACG AAGGCGATGA CAGCCGGCTG GACGACAAGG GCCAGCCCAA GCTGCTGTCC
ATCCAGGCCG GCGAGAAGGT CTATATGGAC TACAATCTCG AGGTCACGAA CCCGGGCGGC
CACTCGTCGC GGCCGATCCG GGACAACGCC ATCTATCACC TGGCGGGCGG CCTTTCGCGC
CTGGCCGCTT ATGACTTCCC GATCGCCCTG AACGACGCCA CCAAGGGCTA TTTCGAACAG
AGCGCCAAGA TCGAGCCCGA CGCCGAGGTG GCCGGTGCCA TGCGCGCCAT GGTCAAGGAT
CCGACCGACG ACGCGGCCGC CGCCATCCTG GCCCGCGATC CGACCCGCAA CAGCATGATG
CGCACCACCT GCGTGGCCAC CATGGCCGAG GCGGGCCACG CGCCCAACGC CCTGCCGCAG
CGCGCCAAGG CCAACGTCAA TTGCCGCATC CTGCCCGGCA ACGATCCCAA GGTCGTCCGC
GACCAGTTGG AGACGATCGT CGCCGACCCG GCCATCAAGA TCACCCTGGC CGCCGACCCC
GATCCGGTCA GCCCGCCGCC GCCGCTGTCC TCGCGGATCA TGGGTCCGGC GACCAAGGTG
GCCGGCAAGA TCTGGCCGGG CCTGCCCTTC ATCCCGCTGA TGTCGACCGG CGCCACCGAC
GGCCGCTTCA CCAACGCCGC CGGCACGGTG ACCTACGGCC TGTCGGGCCT GATGGCCGGT
CCGGACGGCG ACAACATCCA CGGCCTGAAC GAGCGCATCC AGGTCAAGGC GCTGATGAAC
GGGCGGCGGT TCCTCTACGA GGTGGTCAAG CTGTACGCCG ACGGGAAGTA G
 
Protein sequence
MRGKTKTFLL TAAAMSALAM TSAARAQSAP QDSGETAFRG LYKELVEINT TASVGSCTAA 
AEAMGARLKA AGLPDSDVRV LAPSDYPKFG ALVATLHGKD PKAGAILLLA HIDVVEAKRE
DWVRDPFKLV EEDGYLYGRG TSDDKAMAAI FTDSLVRYKA EGFKPRRDIK LALTCGEEGG
PFNSVPWLLE KYPETLKADF ALNEGDDSRL DDKGQPKLLS IQAGEKVYMD YNLEVTNPGG
HSSRPIRDNA IYHLAGGLSR LAAYDFPIAL NDATKGYFEQ SAKIEPDAEV AGAMRAMVKD
PTDDAAAAIL ARDPTRNSMM RTTCVATMAE AGHAPNALPQ RAKANVNCRI LPGNDPKVVR
DQLETIVADP AIKITLAADP DPVSPPPPLS SRIMGPATKV AGKIWPGLPF IPLMSTGATD
GRFTNAAGTV TYGLSGLMAG PDGDNIHGLN ERIQVKALMN GRRFLYEVVK LYADGK