Gene Caul_2403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2403 
Symbol 
ID5899858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2615952 
End bp2617886 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content69% 
IMG OID641562894 
Productprotein of unknown function DUF303 acetylesterase putative 
Protein accessionYP_001684028 
Protein GI167646365 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.22974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.390685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATAC GACTGGGCGC CTTGGGATTG ATCATCGGTC TTGGCGTGGC GACGGCGAGT 
TTGGCCGCGC CACGCCTCGA TGGCGTGATT TCCGACCATG CGGTGCTCCA GCGCGACAGA
CCCTTGGTCG TGACAGGCCA AGCGCGGGCC GGTGAGATGG TGACCATCGC TTTGGCCGGA
CGGAAAGCTC AGGGACGCGC CGACGGCCAA GGACGTTTTC GGTTGACGTT GCAGGCGCTG
CCCGCCGGCG GGCCACACCA GCTTCTGGTG TCCGCTCCGA GCGGTCAGCT TTTCGTCGAT
GACCTGATGA TCGGCGATGT CTTTCTCTGT TCCGGTCAGA GCAACATGGA GCTTTCCGCG
GAACAGGCCC AGAACAGCTT CCAGATCGCG GGCGCGGCCG ACGCGGGCCT GCGACTTTTG
ACGGTCGCCA AGCAAACGGC CACCGCGCCG CTGGTTGGCT TCAATACGCC GCCGGCCTGG
ACGGCGGCGA CGCCTCAAAC CATCGCGAAG TTCTCGGCGG TCTGCTTCTA CACCGGCCAG
GCGCTACGCG AGACCGTCAA GGCTCCAATC GGCCTGATCC ACGCCAGCTG GGGTGGATCG
CGGATTTCTG CCTGGATGGC GCCTCAAGGG CTTGCCGCGG CGGGCATGGA AGCCCAGGCC
CAGACCCTCG CACTCTACAA CCGCGACACC GCCGCCGCCG ACGCGCAGGC GGGCGCGGCC
TGGGAGACTT GGTGGCGTGA GCAGTCCGGT GATCGGGCTG GGCAAGAACC CTGGCGTACA
GACGGTCGGC TGACCTGGGC GGCCGCGCCC GGCGTGGGCT ATTTCAACCG CTGGGGCGTC
GCGGCGCTGT CCCGCTATGT CGGCATGGTC TGGTTCAAGA AGGAGTTCGA CCTGACGGCG
GCGCAAGCGC GGCAGAGCGC GGTGCTGTCG CTTGGCGCGA TCGACGACGC CGATAGGACC
TGGGTGAATG GCGCGCCGGT CGGCGGCGGC AGCATCGCCA GCGCCCCGCG CCAATATCGG
CTGGCGCCCG GCGCGCTGGT CGAGGGCCGC AACGTGATTA CGGTCAACGT CGACAACGTC
TATGCCGAGG GCGGCATGAC GGGACCCGCC AGCCTGATGC AGTTGCGCTT TGACGATGGC
TCGAGCCTGC CGCTCGACGC GGGCTGGCGT TACGCGATTG GCGGAAAGCC AAGATCCAAT
GCGCCGCGCT CGCCTTGGGA CGATATCAAC GGCGCCGGCA CACTCTACAA CGCCATGATC
GCGCCCCTGG GACCAACCGC GCTTCGAGGC GTGGCGTGGT ATCAGGGCGA GTCCGACACT
GACGCCCCGG GCTATGACCG TCGGCTGACA GCGATGATGG CCGACTGGCG AACCCAGTTC
GCCGCGCCCG ACCTGCCGTT CGCTGTCATC CAGCTTTCAG CCTATGGCGC GACCGCATCG
GCGCCGACCG AAAGCGGATG GGCGCGCCTG CGTGACATCC AACGTCACAC CGCTGAGGCC
GACGGGCGCG CCGCGGTCGT GGTCACGGTC GATCTGGGCG ACCGGTTCGA TATTCACCCC
GGCGAAAAGC AGGAAGTGGG GCGGCGATCT GCCCGCGCCC TGCGCGCCCT GGCCTACCGG
GAGTCCATCG CGGCGTCCGG GCCACGGATT GCGTCGGCGC TTCGTCAGGC CGACGGCGGC
GTGACACTGA CCGTCGCGGA CGCCGAGGGG GGCTTGGTTA TGCTTGGCGC CGACAGGGCC
ATCGGTTTCG AGGCCTGTGA GGCGGCGGGC GCCTGCCGCT ACGCCGACGC CAGGGCCATG
GGCGACCACG TCGTCCTCGC TGGCGTCGGC CGGTCCGTGA CGCGCGTGCG CTACGCCTGG
GCCGACAGCC CGGTCATCAA CGTCTTCGAC CGCGCCGGCC AGCCGCTTGG CCCCTTCGAG
ATCGCCGTGC CCTGA
 
Protein sequence
MTIRLGALGL IIGLGVATAS LAAPRLDGVI SDHAVLQRDR PLVVTGQARA GEMVTIALAG 
RKAQGRADGQ GRFRLTLQAL PAGGPHQLLV SAPSGQLFVD DLMIGDVFLC SGQSNMELSA
EQAQNSFQIA GAADAGLRLL TVAKQTATAP LVGFNTPPAW TAATPQTIAK FSAVCFYTGQ
ALRETVKAPI GLIHASWGGS RISAWMAPQG LAAAGMEAQA QTLALYNRDT AAADAQAGAA
WETWWREQSG DRAGQEPWRT DGRLTWAAAP GVGYFNRWGV AALSRYVGMV WFKKEFDLTA
AQARQSAVLS LGAIDDADRT WVNGAPVGGG SIASAPRQYR LAPGALVEGR NVITVNVDNV
YAEGGMTGPA SLMQLRFDDG SSLPLDAGWR YAIGGKPRSN APRSPWDDIN GAGTLYNAMI
APLGPTALRG VAWYQGESDT DAPGYDRRLT AMMADWRTQF AAPDLPFAVI QLSAYGATAS
APTESGWARL RDIQRHTAEA DGRAAVVVTV DLGDRFDIHP GEKQEVGRRS ARALRALAYR
ESIAASGPRI ASALRQADGG VTLTVADAEG GLVMLGADRA IGFEACEAAG ACRYADARAM
GDHVVLAGVG RSVTRVRYAW ADSPVINVFD RAGQPLGPFE IAVP