Gene Caul_1698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1698 
Symbol 
ID5899153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1786192 
End bp1787835 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content65% 
IMG OID641562188 
Producthypothetical protein 
Protein accessionYP_001683325 
Protein GI167645662 
COG category 
COG ID 
TIGRFAM ID[TIGR02474] pectate lyase, PelA/Pel-15E family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.498115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0888224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCTTA GCCCCCTCAC CCGCCGCCGC CTGCTGGCGA CGACCACCGC CACCTTCGCC 
GTCGCGGCGG CCATGGCCCC GCTCAAGGCC TTGGCGCAAG GGATCGCGCC CGCCAAGGAC
CAGGTCCTGG CGACCATGAA GAAGGCCACC ACCTTCATGG CCGAGAAGGC CGCCTATGAG
GGCGGCTATG TCTGGAGCTA CCTGCCCGAC TTCTCGCGCC GTTTCGGCGA GATGGAAGCC
TTCCCGACCA TGATCTGGGT CCAGCCGCCG GGCACGGCGA CCATGGGCCA TCTGTTCCTG
GACGCCTATC ACGCCACCGG CGACGAGTAT TACTACGAGG CGGCCAAGAA AGCGGCCGGC
GCCTTGATCA AGATTCAGCA CCCGGCGGGT GGCTGGAATT ACATGGGCGA CCTCGCCGGA
CCGGAGTCGC TGAAGAAATG GTACGACACC ATCGGCAAGA ACGGCTGGCG GCTCGAGGAA
TTCCAGCATT ACTACGGCAA CGCCACCTTC GACGACGAGG GCACCGCCGA GAGCTGCCAG
CTGATGCTGC GCGTCTATCT GGAAAAGAAG GACAAGGCCT TCAAGCCAGC CCTGGACAAG
GCCATCCAGT TCGTCCTCGA CGCCCAATAT CCGAACGGCG GCTGGCCCCA GCGCTTCCCG
TTGTCGGGCG GTTTCGAGAA CAAGGGCCAC CCCGACTATA CCGGCTACAT CACCTTCAAT
GACGGCGTCG TCGACGAGAA CATCAAGTTC CTGCTGATGG TCTGGCAGAC CTTGGGCGAC
AAGCGCGTGC TCGACCCGAT CCAGCGCGCC ATGGACATCT ATGTCGCCGC CCAGCAACCG
ATGCCACAGC CCGGCTGGGG CCTGCAGCAC ACCGTGGCCG ACCTGAAGCC GGCCGGCGCC
CGCACCTACG AGCCCAAGGC CTTCGCCAGC CACACCACGG CCGGCAACCT CGAAAACCTG
ATGGACTACT ATCAGCTGAC GGGCGATCCG AAGTACCTGG CCCGCGTGCC CGAGGCCATC
GACTGGCTGG CCAACCTGAA GCTCGACGAT AAGATCGCCC CCGGCAAGCC GCGCTATCCC
ACCTTCATCG AGATCGGCAC CAACGAGCCG CTCTACGTGC ACCGTCGCGG CTCCAACGTG
TTCAACGGCG AGTACTTCGT CGACAAGCAC TGGGAAAACA CCATCGTCCA CTACTCATCG
TTCCGCTCGG TCAACATCGA CAAGCTGCGC CAGCGCTACG CGGCGCTGAA GGCGACGCCG
CCAGAGGTCG CCAGCAAGGA TTCGCCGCTG AAGGTGAAGG GCGGCAGCCG GCCGCTGCCC
CGCTACTTCA CCACCAAGGA CCTCTCGGTG TCCGACCTGA ACGTCGGCGC CCTGAAGACC
AGCGACAAGA CCACCGACGC CGACGCCGCC AGGCTGATGG CGCAACTGAA CGCCGAGGGC
TGGTGGCCCA CCGAGATGAA GGCGGTCAGC AACCCCTATA TCGGCGATGG CTCGATGACC
GTCACCCCGG GCGAGTTCTC CCAGACCCGC GTCGGCGACC CCACCGACAC CTCGCCCTAC
ATCGCCGACA CGCCCAAGCC GGGCATCTCG ACGGGCGCCT ATATCGAGAA CATGGCGGCG
CTGATCCGCT ACGTGACGGC GTAA
 
Protein sequence
MPLSPLTRRR LLATTTATFA VAAAMAPLKA LAQGIAPAKD QVLATMKKAT TFMAEKAAYE 
GGYVWSYLPD FSRRFGEMEA FPTMIWVQPP GTATMGHLFL DAYHATGDEY YYEAAKKAAG
ALIKIQHPAG GWNYMGDLAG PESLKKWYDT IGKNGWRLEE FQHYYGNATF DDEGTAESCQ
LMLRVYLEKK DKAFKPALDK AIQFVLDAQY PNGGWPQRFP LSGGFENKGH PDYTGYITFN
DGVVDENIKF LLMVWQTLGD KRVLDPIQRA MDIYVAAQQP MPQPGWGLQH TVADLKPAGA
RTYEPKAFAS HTTAGNLENL MDYYQLTGDP KYLARVPEAI DWLANLKLDD KIAPGKPRYP
TFIEIGTNEP LYVHRRGSNV FNGEYFVDKH WENTIVHYSS FRSVNIDKLR QRYAALKATP
PEVASKDSPL KVKGGSRPLP RYFTTKDLSV SDLNVGALKT SDKTTDADAA RLMAQLNAEG
WWPTEMKAVS NPYIGDGSMT VTPGEFSQTR VGDPTDTSPY IADTPKPGIS TGAYIENMAA
LIRYVTA