Gene Caul_1831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1831 
Symbol 
ID5899286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1938035 
End bp1939291 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content69% 
IMG OID641562321 
ProductGDSL family lipase 
Protein accessionYP_001683458 
Protein GI167645795 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2755] Lysophospholipase L1 and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0423469 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTCA GCAAGCGCGC CCTGCCGGGG CTGGCCATCG CCCTGGCCTT GTCGATCGCC 
TCGATCTCGG CCGCCGCGCC AGCCAAGGCG CCGCAACAGC AGCGTTGGGT GGGCTCGTGG
GCGTCTGCCC AGATGATTCC CGCCTCGAAG GACGCCCTTG CCCCGGCCGA CCTAGCCGAC
GGCACCCTGC GCCAGACGCT GCGCCTGAGC ACCGGCGGCG GCAAGATCCG CGTGCGGCTG
TCTAACGCCT TCGGAACCGA CCCGTTGAAG CTGACGGGCG TGCACGTCGC CCTCGCGGCC
GCGCCCGGCT CGGCCCGCAT CGATCCGGCG ACGGATCGCG CCCTCACCTT CTCCGCCCGC
CCGGACGTGA CCATTCCACC TGGCGCCGAA TATCTGTCCG ATCCGCTCGA CTTCCCGGCC
GCCCCGCTGG CGAACCTGAC TGTCAGCATG CGCTTCGTCG GTCTTCCTGC CCAGCAGACC
AGCCATCCGG GGTCGCGCAC CACCTCGTGG ATCGCCGCCG GCGACCAACT TGGCGCCGCC
GACCTGCCCG GCGCCAAGTC GATCGACCGC TGGTACCAGT TGTCGGGCGT CGACGTGCTG
CGGGCTGGCG GTTCCAGCCT GGTGACCTTC GGCGACTCGA TCACTGACGG TTATGGCGTC
ACCCCCAACG GCAACAATCG CTGGCCCGAC ATCCTGGCCG CCCGGTTGCA GGCAGACCGG
CGCACCGCCG GCGTGGGCGT GCTGAACCTT GGCATCGGCG GCAACCGGCT GCTGCTCGAC
GGCCTGGGTC CCAACGCCAT GACCCGCTTC GATCGCGACG TGCTGGTCCA GGCCGGCGTC
AAGCACCTGA TCGTGCTGGA GGGCGTCAAC GACCTGGGGG TCCTGACCCG CGACCAGCCG
GTGGATCCGC AGGTCCATGC GCAACTGGTG GCCAATGTCC TCGCGTCCTA CGCCCAGATG
ATCCAGCGGG CCCGCGAGCA CGGGATCAAG ATCCATGGCG CCACCATCAT GCCCTATGGC
GGTTCGGCCT ATTACCACCC CGCAGCCGTC AACGAGCAGG ACCGGCAGGC GATCAACGCC
TGGATCCGCG CGCCCGGCCA CTTCGATTCG GTGATCGACT TCGACAAGCT GATGCGCGAT
CCGGCCGATC CCAGCCGCCT GTCGCCCGCC TACGATTCCG GCGACGGCCT GCATCCGTCG
CTGGCCGGCT ACAAGGCCAT GGCCGACGCC ATTCCGCTCA AGCTGTTCGC GCCCTGA
 
Protein sequence
MDFSKRALPG LAIALALSIA SISAAAPAKA PQQQRWVGSW ASAQMIPASK DALAPADLAD 
GTLRQTLRLS TGGGKIRVRL SNAFGTDPLK LTGVHVALAA APGSARIDPA TDRALTFSAR
PDVTIPPGAE YLSDPLDFPA APLANLTVSM RFVGLPAQQT SHPGSRTTSW IAAGDQLGAA
DLPGAKSIDR WYQLSGVDVL RAGGSSLVTF GDSITDGYGV TPNGNNRWPD ILAARLQADR
RTAGVGVLNL GIGGNRLLLD GLGPNAMTRF DRDVLVQAGV KHLIVLEGVN DLGVLTRDQP
VDPQVHAQLV ANVLASYAQM IQRAREHGIK IHGATIMPYG GSAYYHPAAV NEQDRQAINA
WIRAPGHFDS VIDFDKLMRD PADPSRLSPA YDSGDGLHPS LAGYKAMADA IPLKLFAP