Gene Caul_1193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1193 
Symbol 
ID5898648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1253878 
End bp1255806 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content66% 
IMG OID641561676 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001682821 
Protein GI167645158 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAGT TTTTCGCCGG CGCCGCGATG GCCGCGCTGA TCGGGTCGAC CACCGCCGCG 
GCCGCGCCGA TCGAAGCGTA TGGTCGCCTA CCAGCCATGA GCGACGTGTC GATCTCGGCC
AACGGCGCCA ATCTCGTCTA CATCCTGAAC GAGGGCGGCA CGGCCACCGT CGTCGCCCAG
GGCCTGGACG GCACGGTGCT GCAATCGGCC AATCTGGGCG CGCGCAAGGT GCGGGGGGTG
GCCTGGATCG ACGACACCCA CGCGATGATC GAGATCTCGT CGACGGCCGG CATCGAGGGC
GTGGCCTATG TCCACGAATG GTTTCAGGCC GTCAGCCTCA ACATCAAGAC CGGCGCGGTC
GTGCGGGTGC CCGACAGCGC TGCGTCGAAA GACCTGCTGA ACGTCATGAT GAGCACGCCC
CAGGGCGGAA CCTATGGCGG CAAGTCGGTG ATCTGGGCCA GCCTGTACGC CAAGGACGGG
ACCAGCATCC AGGACGACGG ACACATGGAT TTCTACCGTG TGGACCCCGA CACCGGCGTG
GGCCGCGTCC AGCAACCCGG CGACGGCGAG ACCCAGGAGT TCCTCGCCAA GCCGGACGGC
ACGGTGCTGG CGCGCGTCAA GTACAAGCCC AAGGATGGTC ACTGGCGGCT CGAGCTGCGC
CATTCCGGCT GGGACGAGGC CTATTCGGTG TTCGCTCCGA TCGATACGCC GGACCTGATC
GGCCGGTCGC TGGACGACAA GTCCCTGATC CTGCGCCTGT GGGATGACAA GGAAGAGATG
TGGCGGCTGG CGCCCGTCTC CCTGGCCGAC GGCAAGATCG GCGACTATTT CGGCCCGGAC
AGGCCGTTCG GCGTCGTCAC CGACGACGAA CAACGTTTGA TCGGCCTGTC CTCCACCGAC
GTCTACACGG AATACGAGTT CTTCGAGCCC CGCCTGAAGG CGGTCTGGCC GCAGGTGCGC
CAGGTGTTCG CCGGACGCCA GGTGACCCTG ACCTCCAACA CGCCCGACTA CGCCAAGCTG
ATCGTCTATG TCGAAGGCAC CGGCGAGCCC GGCGGCTACT ATCTGGTCGA TCTGGGAGCC
AAGAAGGTCA AGCGGATCGG CGCGGCCTAT CCCGCCCTGA CCGGCGGCGA CATCGCCCAG
GTACAGGCGA TCAAGTACAA GGCCGCCGAC GGCCTGGAGA TTAACGCCTA CCTTACCCTG
CCCAACGGCA GGCCCGCCAA GAGCCTACCC CTGATCGTTT TCCCGCACGG CGGGCCGCAG
TCGCGCGATG GCGCGGGTTT CGACTGGTGG GCCCAGGCCA TGGCTTCGCG CGGCTATGCC
GTGCTGCAGC CCAACTTCCG CGGCTCGTCC GGCTATGGCC GCAAGTTCGT GGAGGCCGCA
TACGGTGAGT GGGGCGGCAA GATGCAGACC GACCTGTCCG ACGGCGTCCG CGCCCTGGCC
AAGGCAGGCA CGATCGATCC CAAGCGCGTC TGCATCGTCG GGGGCAGCTA CGGCGGCTAC
GCGGCCCTGG CTGGCATTAC CCTGGACAAG GGCGTCTACC GCTGCGCCGT GGCCGTGGCC
GGCGTGTCCG ACATGGGTAA GATGCTCGAC CGCGAAACGG CTCGGTCGGG CGCCGACAGC
AGCACCGTCC GTTACTGGAA ACGCTACATG GGCGTGGAGA AGTCGTCGGA CGCCTTGCTT
AACCAGCGCT CGCCGGTGAA CTTTGCCAAC AACGCCGACG GCCCTGTCCT CCTGATCCAT
GGCAAGGACG ACACCGTGGT CAACTATGAT CAGAGCGCAG CCATGCGCCA TGCCCTCGAA
AAGGCCGGGA AGCCGGTGGA ACTGGTCACG CTGAAGGCCG AAGACCACTG GCTTTCACGT
GAAGGCACCC GCCAACAGAT GCTGTCGGAG ACCGTCACCT TCCTGGAAAA GAACAACCCG
CCGAACTAG
 
Protein sequence
MLKFFAGAAM AALIGSTTAA AAPIEAYGRL PAMSDVSISA NGANLVYILN EGGTATVVAQ 
GLDGTVLQSA NLGARKVRGV AWIDDTHAMI EISSTAGIEG VAYVHEWFQA VSLNIKTGAV
VRVPDSAASK DLLNVMMSTP QGGTYGGKSV IWASLYAKDG TSIQDDGHMD FYRVDPDTGV
GRVQQPGDGE TQEFLAKPDG TVLARVKYKP KDGHWRLELR HSGWDEAYSV FAPIDTPDLI
GRSLDDKSLI LRLWDDKEEM WRLAPVSLAD GKIGDYFGPD RPFGVVTDDE QRLIGLSSTD
VYTEYEFFEP RLKAVWPQVR QVFAGRQVTL TSNTPDYAKL IVYVEGTGEP GGYYLVDLGA
KKVKRIGAAY PALTGGDIAQ VQAIKYKAAD GLEINAYLTL PNGRPAKSLP LIVFPHGGPQ
SRDGAGFDWW AQAMASRGYA VLQPNFRGSS GYGRKFVEAA YGEWGGKMQT DLSDGVRALA
KAGTIDPKRV CIVGGSYGGY AALAGITLDK GVYRCAVAVA GVSDMGKMLD RETARSGADS
STVRYWKRYM GVEKSSDALL NQRSPVNFAN NADGPVLLIH GKDDTVVNYD QSAAMRHALE
KAGKPVELVT LKAEDHWLSR EGTRQQMLSE TVTFLEKNNP PN