Gene Caul_3875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3875 
Symbol 
ID5901337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4194930 
End bp4196279 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content68% 
IMG OID641564397 
Productmembrane dipeptidase 
Protein accessionYP_001685499 
Protein GI167647836 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGGGT CGTTCTGCAG CGCCGGCGCG GCCCGAGGGC CGGAGGGCGC TTGCGCCCTT 
GACCGCACCC CTCCGATCCC CGAGCGTGGC CGCAACGCCA TTCGCTTCGG AGCCAAGATG
TCCCGCCTGC TGACCGCCCT GCTCGGCTCC GCCGCCCTGT TCGCCACCGC CGCACACGCC
ACCGACACGC CCAGGCCGGG CGAGGTCTCC AAGCAGGATA GGGTTCTGCA CGAAAAGGTC
CTCACGCTCG ACACGCACCT GGACACCCCC GAGCACTTCG CCCGCCGGGG TTGGAGCATG
ATGGACCGGC ATGTCGTCAC CGAGGACGGC ACCCAGGTCG ACCTGCCGCG AATGAACGCC
GGCGGCCTAG ATGGGGGGTT CTTCGTGATC TATACCACCC AAGGGCCTCT GACCGCCGAG
GGCTATCGCG GCGCGCGTGA CTTCGCGCTC GAACGCGCCA CCGAGATCCG CGAGATGGTC
GCCGCCCATC CCGACAAGTT CGAGCTGGCC TACACCGCCG ACGACGCCGA GCGGATCAAC
AAGGCCGGCA AGAAGTTCGT CTTCCAGAGC ATCGAGAACA GCTGGCCGAT GGGTGAGGAT
CTCACCCTGA TGCGGACCTT CTACGCCACC GGCGTGCGGA TGGCCGGACC GGTCCACTTC
CGCAACAACC AGTTCGCCGA CAGCTCGACC GACAAGCCGA TCTGGCACGG CTTCTCGCCG
CTGGGCCTGC GCTGGCTGGC CGAGGCCAAC CGGCTGGGGA TCCTGATCGA CGTCAGCCAC
GCCTCCGACG ATGTGGTCGA CCAGGCCGTG GTGCTGTCCA AGGTCCCGAT CATCGCCTCG
CACTCCGGCG CCAAGGCGGT CTATGACGCC GCCCGCAATC TCGACGACGG GCGGCTGAAG
AAGATCGCCG ACGCGGGCGG GGTGATCTGC ATCAACTCGG TCTATCTGAA GGCCACGCCC
ACCAGCCCAG AGCGCAAGGC CGCGTTCGAG GCTCTGGGCA AGGCCCCCGA CAGCGAGACG
GCGAGCGAAG CCGAGATCGT CGCCTTCATG AAGAAGAAGG TCGAGATCGA CGCCAAGTTC
CCGCCGGTCC GCGCCTCGTT CGAGGACTTC ATGGCCAGCC TGACCCACAC CCTCAAGCTG
GTCGGCCCCG AGCACGTCGG CATCGGCGCC GACTGGGACG GCGGCGGCGG CGTGATCGAC
TTCGAGGACG TCGCCGACCT GCCCAAGGTC ACCGCCCGGC TGAAGGCGGC CGGCTACACC
GACGCCGACG TGGCGGCGAT CTGGGGCGGC AACGTGCTGC GCGTGGTGAA GCAGGCGCAG
GACTACGCGA AGGCGGCAGC GGCCAAGTAA
 
Protein sequence
MPGSFCSAGA ARGPEGACAL DRTPPIPERG RNAIRFGAKM SRLLTALLGS AALFATAAHA 
TDTPRPGEVS KQDRVLHEKV LTLDTHLDTP EHFARRGWSM MDRHVVTEDG TQVDLPRMNA
GGLDGGFFVI YTTQGPLTAE GYRGARDFAL ERATEIREMV AAHPDKFELA YTADDAERIN
KAGKKFVFQS IENSWPMGED LTLMRTFYAT GVRMAGPVHF RNNQFADSST DKPIWHGFSP
LGLRWLAEAN RLGILIDVSH ASDDVVDQAV VLSKVPIIAS HSGAKAVYDA ARNLDDGRLK
KIADAGGVIC INSVYLKATP TSPERKAAFE ALGKAPDSET ASEAEIVAFM KKKVEIDAKF
PPVRASFEDF MASLTHTLKL VGPEHVGIGA DWDGGGGVID FEDVADLPKV TARLKAAGYT
DADVAAIWGG NVLRVVKQAQ DYAKAAAAK