Gene Caul_2528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2528 
Symbol 
ID5899983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2741660 
End bp2744110 
Gene Length2451 bp 
Protein Length816 aa 
Translation table11 
GC content65% 
IMG OID641563019 
Productorganic solvent tolerance protein 
Protein accessionYP_001684153 
Protein GI167646490 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.150977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGAGT TTCGCTGGGG GGCGAAGGAC CCAAAGGCCG TCGGTATGGG CCGCGCGGTC 
CTGCTGGCCG GTGCGGCCTG GCTCGCCCTC GCCTGCTCGG CGCAGGCGCA ACAGCCGCTC
GCCACCGTTC CGGCCGCTCC GACCCCGTCC CCGGCGGTCG ACGATGGCCT GGGCGACACG
GGCTACTATC TCGAATCCGA CCTGCTGATT CGCGACGACG CCAATCAGAA GATGATCGCC
CGCGGCGAGG TCGAAGCCCG CTACCAGGGC CGCACCCTGC GGGCCGACGA GGTGGTCTAC
GACAGCAAGA CCGAGGTGGT CACCGCCCAC GGGCACGTGC AACTGATCAA CGCCGACGGC
ACCGCCCAGT TCGCCGACGA AATGACCATG GACAAGGACA TGAAGGCGGG CTTCGCGCGC
GGCTTCTCGG CCCGCCTGGA CAAGAACATC AAGATCGCGG CGGACACCGC CGTGCGTCGC
AACGAGCAGA TCACCGAGCT GAACCAGGCG ATCTACACAC CCTGCGAGGT TTGCGCCGAA
AAGCCCAAGC CGACCTGGAG CATCCAGGCC GACAAGGTCG TGCAGGACAA GAACCGCCAC
CTTGTCTACT ACCACGGCGC GACGATCCGC ATGTGGGGCG CGCCGCTGCT GTACCTGCCG
GTGTTCTGGC ATCCCGACCC GCAGACCCAG CGCAGTTCGG GCTTCCTGAC GCCGAAGCTG
GGCGTCTCCA AACGGCGCGG CGTCTCCTAT CAGCAGCCCT ATCTGTTCGT GTTCTCGCCG
TCTCAGGATT TGGTGCTAAC CCCTCAAATC AATGCGAAAG TTAACCCGTT CCTGAACGCG
CAGTACCGCA AGCGCTTCTA CTCCGGCGCC GTCGATGTCC GGGCCGGCGG AACCTACGAC
AAGGACTTCG ACAACCACGG CGACCGGTTC GGCAAAGGCA TGTTCAAGAG CTACATCCTC
GCCCGCGGTC TGTTCGATAT CGACCAGAAG TGGAAGTGGG GCTTCACCGC CGAGCGGGCT
TCGCAGGCGC TGATCTTCGA CGACTACGAC ATCAGCGACG TCTACCAGCA GCGCGGCCAG
TTCACGGCCG ACGACCACCG GCTGATGTCG CAGATCTACA CCACGCGGCA GGACAAGCGG
TCCTACTTCT CGGCTTCGAT GATTTCGGTG CAGGGTTTGC GGGTTGTGCA GGTGGATCCG
GGCACGGGCC TGGCCAACCG GTTCGAGAAC AGCGGCGCCT TCCCCCTGAT CGGCCCCCTG
GTCGAGGGGC GCTGGGAGCC GGAATCGCAC ATTCTGGGTG GCCGGCTTCG CGTCCAGGGC
TCCGGCGTGG TGCTGACCCG CTCGGAATCC CAGTTTGGCG AGCCGCCCTA CGCCTATGCC
GACTACAAGG GCAAGGACGG CGTGGATTCC ACCCGCGGCA CGATCCAGGG CGACTGGCGC
GCCAGCGTGG TCCTGGGTTC GGGCCTGCGC GTTGAGCCGT TCGCCCAGGC GCGCGGCGAC
ACCTATCGGG TCAAGGACGT CTTTATCCCG GTCAACGCCT TCACCACGGG CGACACCCAC
AGCATCAACT CTTCGCGCGG CCTGGGCGTC GCCGGCGTCG ATCTGAGCCT GCCCATGTTC
AAGCCGCTGA AGAACGGCGG CAGCATCGTT CTGGAGCCGC TCGCTCAATT CGCCACCGGC
TCCAACAGTT CGCGGGTGCC GATCATCGTG GCCCGTGACG CGGCCGGAAA CCCGATCTAT
TTCAACGAAG ACAGCACCAA CTTCGAACTC GACGAAACCA ACCTGTTCGA CGTGAACAAG
TCGCCCGGCT TCGACCTCTA CGAAGGCGGC ACGCGCGTCA ATCTCGGCGG TCGCGCCACG
GTCAAGTTCG CTGACGGTCG AGGCGGCAGC GTACTGGTCG GCCGCAGCCT GCGCACCAAG
GTCGATCCGC TGATGCCGAC CCGCGCCGGT CTCGACCAGA AGGCTTCCGA CTGGATCGTC
GCGGCCACGG TCACACCGAT CCGCGGCGTC AACGCCTTCT CCCGCGCCCG TTTCGACAAC
GACACCGGCA AGCTCAACCG GATCGAAGCC GGCGTCGATG CGTCGGTTTC GCGGGGCTTC
GGTTCGCTCC GCTACCTGCG CGACAACAAG GACACCTCGG GCTTCCGCCA GGAAAACCTG
GACTTCTACG GCGACTACAA GATCCGCGAG CACTGGGGCG TGACCGCCCT GGGCCGCCTA
TCGTACCAGG ACGCCCGCGC CTTCGGCCTT CCAGCCGCCG ACAGCCAGTG GTCCTGGACC
CGTCGCGACC TGGGCGTCTA CTACAAGGAC GACTGCATCC GCATCGACGT GGTCTATCAG
AACGAGGACC GTTACACCCA GACGTCGAGC GGACTGAAAT TGAAGGCCGA CGAGTCCGTG
GTGCTGCGCC TAACGCTCGC CACATTAGGC GACACACTGT ACAGCAATTA G
 
Protein sequence
MMEFRWGAKD PKAVGMGRAV LLAGAAWLAL ACSAQAQQPL ATVPAAPTPS PAVDDGLGDT 
GYYLESDLLI RDDANQKMIA RGEVEARYQG RTLRADEVVY DSKTEVVTAH GHVQLINADG
TAQFADEMTM DKDMKAGFAR GFSARLDKNI KIAADTAVRR NEQITELNQA IYTPCEVCAE
KPKPTWSIQA DKVVQDKNRH LVYYHGATIR MWGAPLLYLP VFWHPDPQTQ RSSGFLTPKL
GVSKRRGVSY QQPYLFVFSP SQDLVLTPQI NAKVNPFLNA QYRKRFYSGA VDVRAGGTYD
KDFDNHGDRF GKGMFKSYIL ARGLFDIDQK WKWGFTAERA SQALIFDDYD ISDVYQQRGQ
FTADDHRLMS QIYTTRQDKR SYFSASMISV QGLRVVQVDP GTGLANRFEN SGAFPLIGPL
VEGRWEPESH ILGGRLRVQG SGVVLTRSES QFGEPPYAYA DYKGKDGVDS TRGTIQGDWR
ASVVLGSGLR VEPFAQARGD TYRVKDVFIP VNAFTTGDTH SINSSRGLGV AGVDLSLPMF
KPLKNGGSIV LEPLAQFATG SNSSRVPIIV ARDAAGNPIY FNEDSTNFEL DETNLFDVNK
SPGFDLYEGG TRVNLGGRAT VKFADGRGGS VLVGRSLRTK VDPLMPTRAG LDQKASDWIV
AATVTPIRGV NAFSRARFDN DTGKLNRIEA GVDASVSRGF GSLRYLRDNK DTSGFRQENL
DFYGDYKIRE HWGVTALGRL SYQDARAFGL PAADSQWSWT RRDLGVYYKD DCIRIDVVYQ
NEDRYTQTSS GLKLKADESV VLRLTLATLG DTLYSN