Gene Caul_1347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1347 
Symbol 
ID5898802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1430399 
End bp1431937 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content67% 
IMG OID641561834 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_001682975 
Protein GI167645312 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCCTT TCATCCACGC CCAGACCCAA CCCGACAAGC CCGCCTACAT CATGGCCGGC 
TCCGGCGAGA CGGTGACCTA CGGCCAGCTG GACGCCCGCT CCAACCAGGG CGCCCAGCTG
TTCCGCTCGC TGGGCTTGAA GGCCGGCGAC GTGATCGCCA TCCTGATGGA CAACAGCCCG
CGGTTCTTCG AGATCGCCTG GGCGGCGCAG CGCGCGGGGC TCTACTACAC CTGCGTCTCG
ACCAAGCTGA CGCCGGCCGA GGTCGAGTAC ATCGTCAAGG ACTGCGGGGC CCAGGTGCTG
ATCGTCAGCC CGGCCTTGGA CGATGTCGCC CAAGCCGTCG CGCCGCTGAT CCCCGGCGTG
CGCCTGTTCC GGGTTGGCGG CGGCAAGGGC GCGTTCGAGG ACTTCGAGGC CGCGCGCGAC
GCCATGCCGG CCACGCCGAT CGCCGACGAG ACCTCGGGTT CGGACATGCT CTATTCCTCC
GGCACCACCG GCCGGCCCAA GGGGGTCAAG CCGGCGCTGA CCGGCGGGCC GATCGACGCG
CCCCACGCCC TGCAGATGAT GGCCATGGGC CTGTTCGGCT TCAGCGGCGA CAGTGTCTAC
CTGTCCCCCG CCCCGCTCTA TCACGCCGCG CCGCTGCGCT GGTGCATGAC CGTCCAGAAG
CTGGGCGGCA CGGTGATCGT GATGGAGAAG TTCGATCCCG AGGCGGCCTT GGCCCTGATC
GAGAAATACA AGGTGACTTG CGGCCAGTTC GTGCCCACCC ACTTCGTGCG GATGCTGAAA
CTGCCCGAGG CGGTTCGGGC CAAGTACGAC GTGTCGTCGA TCAAGTCCGC CGTCCACGCC
GCCGCCCCCT GCCCCGTGCC GGTCAAGGAA CAGATGATCG CCTGGTGGGG GCCGGTGATC
TTCGAATATT ACGCCGGCAC CGAGGGCAAT GGCTTCTGCT GGATCAATTC GCAGAACTGG
CTGACCCATA AGGGCAGCGT CGGCCAGGCG GTGCTGGGCG AACTGCGGAT CTGCGACGAG
GACGGCAATC CGGTTCCGCC GCGCACCGAG GGCACGGTCT ATTTCGCCAA CGGCCCCGCG
GTGAACTACC ATAACGCCCC CGACAAGACC GCCGAGAGCT ACAACCAGCA TGGCTGGACC
ACCCTGGGCG ACGTGGGCTG GGTCGACGAG GAGGGCTATC TCTACCTGAC CGACCGCAAG
AGCTTCATGA TCATCTCGGG TGGGGTGAAC ATCTACCCTC AGGAGATCGA GAACCTGCTG
ATCACCCACC CCAAGGTGGC CGACGCCGCC GTGGTCGGCG CCCCGCACGA GGAAATGGGC
GAGCAGGTGG TGGCGGTGAT CCAGCCGATG GACTGGGCCG AGGATCAGAC GGACCTGGCC
CAGGAACTGG CCGCCTTCTG CCGCGCCAAT CTCAGCCACG TGAAGTCGCC GCGCCGAATC
GACTTCATGC AGGAACTGCC CCGCCACGCG ACGGGCAAGC TCTACAAGCG GCTGATCCGG
GATGCGTACT GGGCGCAGGG CGAGAGCCGG ATCGGGTAG
 
Protein sequence
MHPFIHAQTQ PDKPAYIMAG SGETVTYGQL DARSNQGAQL FRSLGLKAGD VIAILMDNSP 
RFFEIAWAAQ RAGLYYTCVS TKLTPAEVEY IVKDCGAQVL IVSPALDDVA QAVAPLIPGV
RLFRVGGGKG AFEDFEAARD AMPATPIADE TSGSDMLYSS GTTGRPKGVK PALTGGPIDA
PHALQMMAMG LFGFSGDSVY LSPAPLYHAA PLRWCMTVQK LGGTVIVMEK FDPEAALALI
EKYKVTCGQF VPTHFVRMLK LPEAVRAKYD VSSIKSAVHA AAPCPVPVKE QMIAWWGPVI
FEYYAGTEGN GFCWINSQNW LTHKGSVGQA VLGELRICDE DGNPVPPRTE GTVYFANGPA
VNYHNAPDKT AESYNQHGWT TLGDVGWVDE EGYLYLTDRK SFMIISGGVN IYPQEIENLL
ITHPKVADAA VVGAPHEEMG EQVVAVIQPM DWAEDQTDLA QELAAFCRAN LSHVKSPRRI
DFMQELPRHA TGKLYKRLIR DAYWAQGESR IG