Gene Caul_1304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1304 
Symbol 
ID5898759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1379308 
End bp1381239 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content72% 
IMG OID641561789 
Productprotein of unknown function DUF303 acetylesterase putative 
Protein accessionYP_001682932 
Protein GI167645269 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.222608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.170288 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTCGA TGATCCTTGG CCTGCTGGCC GCCTCCGCTC TCGCCACGGC GGTTCAAGCC 
GCGCCCCTCC TGGCTCCGGT CTTCACCGAC CACGCCGTGC TGCAACGCGG CCAGCCGATC
CGGGTCTGGG GCGGGGCGGG GCCGGGCGAG GCGGTGACCG TGACGCTGGG CGAGGCCGAG
GTTAGGGCGA CCGCCGACGC CCAGGGCCGC TGGATGACGA CTTTCCCGGC CCGCGAGCCG
GGCGCGGCCC TGACCCTGAC CGCCCAGGCT GGCGGCGAGC GCCAGACGAT CTCGGACCTG
CTGGTCGGCG ACGTGTGGCT GTGCTCGGGC CAGTCGAACA TGGAATATCC GCTGCGCCGG
GCCCTGGGCG GCGAAGCCGA GGCGGCCAAT TCGGCCGACC CGAACATCCG CCTGCTGCAG
ACCGGCCGCA TCAGCCTGCC CACGCCGACG ACCGCCCTGC CCCAGGAGGC GGTCTGGCGC
GCGGCCACCC CGGAATCGGC CAACAACTTC AGCGCCGCCT GCTTCTTCAT GGGCCGCGAC
ATCAAGAAGA CCACCGGCGT CCCGGTCGGC CTGATCGACG CCACCTGGGG CGGCTCGATC
ATCCAGGACT GGATCAGTCG CGAGGGCCTG CGCGCCCTGG GCGGCTACGA CGAAGGCTTG
GAAGTGCTGG CCGACTACGC CAAGTCGCCG GACGTCGGCG TGGCCCACTG GCGCGCCATG
CTCGATCGCT GGGCGGCCAA GATCGAGCCA CAGGCGACCG CCTGGAGCCG CCCGGACTTC
GACGACCATG ACTGGAAGAC CATGCCGGCC GAACGGTTCT GGGAGACCAA TCCAGGCCTG
GAGACCTTTG ACGGAACGAT CTGGCTGCGG GCTGTCATCA CCCTGACCGC CCAGCAGGCC
AAGCAGGGCG CGACCCTGTC GCTGGGGCCG ATCGACGACC TGGACACCAC CTTCGTCAAC
GGCCGCGAGG TGGGCGCGTC GCAGGGCTGG AACACGCCGC GCGCGTACAA GATCGCGCCG
GGCGTGCTGA AAGCGGGCCC GAACGTGATC GCCGTGCGGG CCATCGACAC CAGCGGCGGC
GGCGGCGCCT GGGGGCCGGC CGCCGAAAAG GGGCTGAAGC TGGACGACGG CGCGTTCGTC
CCGCTGGGCG GAGCCTGGCG CTACAAGGTC TCGGCGCCCA TCGCCCAGAG CGGCCTGCCG
CCGACGGCCC CTTGGGTGGG GACCAGCGGC CTGTCGACCC TGCATAACGG CATGATCGCC
CCGCTCGCGC CCTACGGCCT GAAGGGCTTC GCCTGGTACC AGGGCGAGGC CAATGTCGCC
GAACCGGCCG AATATGCCCG CGCCCTGCCG GCCCTGATCG CCGACTGGCG CCGGGTGTTC
GGCGGACCCG ACCTGCCGTT TCTGGTGGTG CAACTTGCCG CGTTCGGGCC GCGTCAGGTC
AAGCCGGCCG AATCCGGCTG GGCCGGAATC CGCGACGTCG AGCGCCGCAC GGCCGCGGCG
GATCCCAAGG TCGGCCTCGC CTCGGCCGTC GATGTCGGCG ACATCTACGA CATCCATCCA
GCCAACAAGC AGCAGGTCGG GCTGCGCCTG GCGCTGCAGG CCCGCAAGCT GGCCTATGGC
GAGACTGCCC TGGTCGCCGC CGGTCCGGCC CCGCTGTCGG CGACGCGGGC CGCGAACGCG
GTGGTGGTGC GGCTGGACCA GCCAGCGGTG GTGCAGGCTG ACGCCCGGCC GGTCGGCTTC
GAGCTGTGCG ACGCGGACGG CGCCTGCCGG TTCGCCGACG CGGCCCTGGC GGGCGACCGG
ATCAGCGTGG TGGTTCCGAC CGGCTTCGCG CCGGTCAAGG TCCGCTTCGC CTGGGCCGAC
AGTCCGATTC TCAACCTTTA TGGCCAGACC GGCCTGCCGG CGACGCCGTT CGAATTGGCG
ATCACGCCCT AG
 
Protein sequence
MRSMILGLLA ASALATAVQA APLLAPVFTD HAVLQRGQPI RVWGGAGPGE AVTVTLGEAE 
VRATADAQGR WMTTFPAREP GAALTLTAQA GGERQTISDL LVGDVWLCSG QSNMEYPLRR
ALGGEAEAAN SADPNIRLLQ TGRISLPTPT TALPQEAVWR AATPESANNF SAACFFMGRD
IKKTTGVPVG LIDATWGGSI IQDWISREGL RALGGYDEGL EVLADYAKSP DVGVAHWRAM
LDRWAAKIEP QATAWSRPDF DDHDWKTMPA ERFWETNPGL ETFDGTIWLR AVITLTAQQA
KQGATLSLGP IDDLDTTFVN GREVGASQGW NTPRAYKIAP GVLKAGPNVI AVRAIDTSGG
GGAWGPAAEK GLKLDDGAFV PLGGAWRYKV SAPIAQSGLP PTAPWVGTSG LSTLHNGMIA
PLAPYGLKGF AWYQGEANVA EPAEYARALP ALIADWRRVF GGPDLPFLVV QLAAFGPRQV
KPAESGWAGI RDVERRTAAA DPKVGLASAV DVGDIYDIHP ANKQQVGLRL ALQARKLAYG
ETALVAAGPA PLSATRAANA VVVRLDQPAV VQADARPVGF ELCDADGACR FADAALAGDR
ISVVVPTGFA PVKVRFAWAD SPILNLYGQT GLPATPFELA ITP