Gene Caul_2542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2542 
Symbol 
ID5899997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2756596 
End bp2757813 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content68% 
IMG OID641563033 
Productmembrane dipeptidase 
Protein accessionYP_001684167 
Protein GI167646504 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.51564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGTT CCCTGCTTCT GGCCGCCGTC TCGGTCTTGG CCTTCGCCAC CTCGTCCCAG 
GCCGCCGACA CCGCCGCCTC GGTCCGCAAG ATCCACGAGG GCCTGCTGAC CCTCGACACC
CATCTGGACA CTCCAGCCAA TTTCGGCCGT CCGGGCTGGG ATATCCTGGA CCGGCATGAC
GCCGCCAAGG ACGGCTCGCA GATCGATTAT CCCCGCATGG TCGAGGGTGG GCTGGATGGC
GGCTTCTTCG CCATCTACAC GCCGCAGGGG CCGCGCACGC CCGAGGCCAC CCGCGCCGCC
CGGGACGGCG CCTTGGTCCG CGGCGTCGAG ATCCGCGAGA TGGTGGCCAA GCACGGCGAC
AAGTTCGCCC TGGCCCTGAA GGCCGACGAC GCCGCCAAGA TCGCCGCCAG CGGCAAGCGC
GTCGTCTTCA TGAGCATCGA GAACAGCTAC CCGATCGACG GCGACGTCAC CCTGCTGTCC
AGCTTCTACG CCCTGGGCGT GCGGATCAGC GGCCTGGCCC ACTTCAAGAA CAACGACATG
GCCGACAGCT CGACCGACAA GCCCGAGTGG CATGGCCTCA GCCCGCTGGG CAAACAGTTC
GTCACCGAGG CCAACCGGCT GGGCGTGGTG CTGGACGGCT CGCATTCGTC CGACGACGTG
CTCGATCAAC TGATCGCCCT GTCCAAGACC CCGGTGATCC TGACCCATTC AGGCTGCAAG
GCGGTGTTCG ACCATCCGCG CAATGTCGAC GACGCCCGCA TCAAGGCCCT GGCCGACAGC
GGCGGGGTGA TCCAGGTCGA CGCCTATTCC AGCTATCTGA TCGACACGCC CAAGAACCCC
GATCGCGAGG CCGCCATGGC CGCCCTGATG GCCAAGGTCG GGGCGCGGGC TAAGATGACC
GAGGAGCAGC GCGCCGCCTT CATAGCCGAA CGCAACGCCA TCGACGCCAA GTGGCCGGTG
ACCAAGGCGA CGTTCCAGGA CTTCATGAAC CACCTCAACC ACGCCCTGAA GGTGGCCGGC
GTCGATCACG TGGGCGTCGG CATCGACTTC GACGGCGGCG GCGGCGTCAC CGGCCTGAAC
GACGCCTCCG ACTACTGGAA GATCTCCCAG GCCCTGCTGG CCGAGGGCTA CACCCAGGCC
GACCTGGAGA AGATCTGGAG CGGCAACGTC CTGCGTCTGC TGCGCGCCGC CGAGGCGGCC
AAGGCGCCGG CGGGGTGA
 
Protein sequence
MTRSLLLAAV SVLAFATSSQ AADTAASVRK IHEGLLTLDT HLDTPANFGR PGWDILDRHD 
AAKDGSQIDY PRMVEGGLDG GFFAIYTPQG PRTPEATRAA RDGALVRGVE IREMVAKHGD
KFALALKADD AAKIAASGKR VVFMSIENSY PIDGDVTLLS SFYALGVRIS GLAHFKNNDM
ADSSTDKPEW HGLSPLGKQF VTEANRLGVV LDGSHSSDDV LDQLIALSKT PVILTHSGCK
AVFDHPRNVD DARIKALADS GGVIQVDAYS SYLIDTPKNP DREAAMAALM AKVGARAKMT
EEQRAAFIAE RNAIDAKWPV TKATFQDFMN HLNHALKVAG VDHVGVGIDF DGGGGVTGLN
DASDYWKISQ ALLAEGYTQA DLEKIWSGNV LRLLRAAEAA KAPAG