Gene Caul_0655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0655 
Symbol 
ID5898110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp722576 
End bp724228 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content68% 
IMG OID641561137 
Productpeptidase M28 
Protein accessionYP_001682286 
Protein GI167644623 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.215111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.412829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCC TGCTGCTCGC GACCGCCCTG TTCGCCGCTC CCATGCTGGC TCACGCGGCG 
GACGCGCCCA AGATCGACCC CGCCAGGCTG TCGGCCCACA TCAAGGTGCT GTCGTCCGAC
GACTTCGAGG GTCGCGGCCC CGCCACGGCC GGCGAGACCA AGTCGGTCGA CTACATCGTG
GGCCAGATGA AGGCGATCGG GCTGGAGCCT GCCGGCGACC TGAAGAACGG AACCCGCGCC
TGGACCCAGG ACGTGCCGCT GGGCAAGTTC GACATCAAGG GTCCGGTCTC CGCTCAATTC
ACGATCGGCG GCAAGGTCGT TCCCCTGGCC CAGGGCGAGC AGATCGCCAT CCGCGCGGCC
ATGACCAATG TCGACGCCGT CGCCATCAAG GACGCGCCGC TGGTCTTCGT CGGCTACGGC
GTCAAGGCGC CGGAACGAAG CTGGGACGAC TTCAAGGGCC TGGACCTGAA GGGCAAGGTC
CTGGTGGTGC TGATCAACGA CCCCGACTTC GAGTCCGGCG CCGGCGACTT CGGCGGCAAG
GCCATGACCT ATTACGGTCG CTGGACCTAC AAGTACGAGG AAGCCGCCCG CCAGGGCGCG
GCGGGCGTGC TGATCGTCCA CGAGACCGCC CCGGCCTCCT ATGGCTGGGC GACGGTCAAG
AACTCCAACA CCGCCACGAT GTTCGACATC GTCCGCGCGG CGCCCGCCAA GGTTCACCCG
AACCTGGAGG CCTGGATCCA GCGCGACGTC GCGGTCGACC TGTTCAAGGC CTCGGGCCTG
GACTTCGACC TCCTGAAAAA GCAGGCCCAG GGCCGCGACT TCAAGCCGGT GGACCTGAAG
GGCGCGACCT TCTCGGCCAG CTACGCCGTC GATCCTTCGG TGATCGTGTC CAAGAACATC
GCCGGCCGGA TCAAGGGCTC GGCCCATCCC GACGAGACGG TGATCTACAG CGCCCACTGG
GACCACCTGG GCGTCGGCCA GCCCGACGCG CGCGGCGACA AGATCTATAA CGGCGCCATC
GACAACGCCG ACGGGATCGC CGCCATCCTG GAGCTGGCCC GCGCCTTCAA GAGCCAGCCG
GCCCCGCAGC GCTCGATCCT GTTCCTGGCC GTCACCGCCG AGGAACGCGG CCTGCTGGGC
TCGGAATACT ATGCGGCCAA CCCGCTCTAT CCGCTGTCCA AGACGGTCGG CGACCTGAAC
ATCGACGCCC TGTCGGCCAC CGGCCCGGCC AAGGACATCA CCACCTCGGG CGACGGCAAG
GTCGATCTGC AGGACCTGCT GGTCGCCAAG GCCAAGGCTC ATGGCCGCTA CTTCACGCCC
GACCCGTCGC CGCAAGCCGG CCACTTCTAT CGCTCGGACC ACTTCCCGTT CGCCAAGCGC
GGCGTGCCGG CCATCTCGGT CGGCTCGGGC GAGGACCTGG TGGTCGGCGG CAAGGAGGCC
GGCGAGAAGG CCGAGGCCGA CTACACCGCC AACCGCTACC ACCAGCCCGC CGACGAATGG
AAGGCCGACT GGGACCTGAC GGGCCAGGCC CAGGATATCG GCCTGTTCTA CGAGATCGGT
TCCGACCTCG CCAATTCGAA GACCTGGCCA GAGTGGCAGG CGGGATCGGA GTTCAAGGCG
CTGCGGGATC AGACCAAGAG CGACCGGAAA TAG
 
Protein sequence
MKRLLLATAL FAAPMLAHAA DAPKIDPARL SAHIKVLSSD DFEGRGPATA GETKSVDYIV 
GQMKAIGLEP AGDLKNGTRA WTQDVPLGKF DIKGPVSAQF TIGGKVVPLA QGEQIAIRAA
MTNVDAVAIK DAPLVFVGYG VKAPERSWDD FKGLDLKGKV LVVLINDPDF ESGAGDFGGK
AMTYYGRWTY KYEEAARQGA AGVLIVHETA PASYGWATVK NSNTATMFDI VRAAPAKVHP
NLEAWIQRDV AVDLFKASGL DFDLLKKQAQ GRDFKPVDLK GATFSASYAV DPSVIVSKNI
AGRIKGSAHP DETVIYSAHW DHLGVGQPDA RGDKIYNGAI DNADGIAAIL ELARAFKSQP
APQRSILFLA VTAEERGLLG SEYYAANPLY PLSKTVGDLN IDALSATGPA KDITTSGDGK
VDLQDLLVAK AKAHGRYFTP DPSPQAGHFY RSDHFPFAKR GVPAISVGSG EDLVVGGKEA
GEKAEADYTA NRYHQPADEW KADWDLTGQA QDIGLFYEIG SDLANSKTWP EWQAGSEFKA
LRDQTKSDRK