Gene Caul_4340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4340 
Symbol 
ID5901801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4717292 
End bp4719154 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content70% 
IMG OID641564858 
Productacyl-CoA synthetase 
Protein accessionYP_001685958 
Protein GI167648295 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGCG CCGAAGCCCT GACGGCCGCG CGGGACCCCA GCAGCGCGCC CTTCAAGCCC 
CTGCCGATGA AGGGCGCCGA CATCAGCGTC GAGCGGCGGG CCGACGGCAG CATCGTCATC
ACCTCCAACC ATCCGCCCGG CGACGGGCCG CGCACCATCA GCCACCTGCT GGCCGAGAAG
GCTGCCGCCC ATCCCGACCG GCCGTATCTC AATCAGCGCG AGCCCGGCCA CGGCCCGTGG
CGCGGCGTCA CCTACGGCCA GGCCCAGCGG GCGGTCGAGG GGATCGCCCA GTGGCTGCTG
GACCAGGGCG TCACCGCCGA CGACAGCGTG ATGATCCTGT CGGGCAATTC CATCGAGCAC
GCCCTGATGA CGCTGGGGGC CTACGGGGCC GGGGTCCCGG CCGCGCCGGT CAGCCCGGCC
TACAGCCTGA TGTCCACCGA CCACGGCAAG CTCAAGCACT GTTTCGACAC GGTCGCGCCG
CGGGTGGTGT TCGCCCAGTC CGGCGTGCTG TTCGACAAGG CCATCGCCGC CTTGCGAGCG
ATCAAGCCCG ACCTGCTGCT GGTCACCGCC GACGGAACTG GCGAGGGCGC GATCGCCTTC
GACACGGTCG CCGCCACCGT GCCCCGCGAA GTCGCCGCCA AGCGCGAGAG CCTGACCCCG
GCCACCGTCG CCAAGTACCT GTTCACCTCC GGCTCGACCG GCGTCCCCAA GGGCGTGCCC
CAGACCCATG CGATGATGGC GGGCGTGATC GCCGGCCAGG AGGGCCTGCG CACCGACGAA
CCCACCGACG AGATCCCCAA CAGCCTGGAA TGGATGCCCT GGAGCCACAT CTCAGCTGGC
AATATCGGCT TCAACGGCGT GCTGTGGTCC GGTGGCACGC TGTGGATCGA CGAGGGCAAA
CCGCTGCCCG GCATGTTCGA GACCACGATC AAGAACCTCT ATGAGGTCTC GCCCGTGGTG
TTCGGCTCGG CCCCGATCGC GTTTTCGATG CTGGCGGGGG CGATGGAGAA CGATCCGGTC
CTGCGCGCGG CGTTCTTCAA GAACCTCAAA TATATGGGCT ATGGCGGGGC CACCCTGTCG
GACGACGTCT ATACCCGCAT GCAGGCCCTG GCCGTGGCCG AGACCGGCCA TCGCGTGCCC
CTGACCACCA TGTATGGCGC CACCGAGACC CAGGGGGTCA CCGTGGTCCA CTGGATCACC
GAGCGGGTCG GGCTGGTCGG GCTGCCGCTG CCGGGCATCG CCCTGAAACT GGCCCCGGCC
GGCGCCAAGT ACGAGGTGCG GGTCAAGGGT CCGACCGTGG CGGCGGGCTA TCACAAGGAC
CCCGAGAAGA CCGCGGCGGC GTTCGACGAG GAAGGCTTCT ACAAGCTGGG CGACGCGGCG
CGGTTCGTGG ATCCGAAAGA TCCGTCCAAG GGCCTGGTGT TCGACGGCCG GGTGACCGAG
GACTTCAAGC TCGACAGCGG CACATGGGTC AGCGTCGGGG TGCTGCGCCC CGACCTGGTG
GCCGCCTGCA GCCCCTACAT CCACGACGCG GTGATCGCCG GCCAGGACAA GCCGTTCGCC
GCCGCCCTGG TCTGGCCATC GCCGGCCGGG CTGGCGGCCC TGGTCGCGGA CCAGGGACCG
GGCACGCCGC TGGACAAGCT GACCGCGATC CTGCGCGAGC GGATCGGGGC GTTCAACGCC
CGGGCCGGCG GCTCGTCGCG CTGCATCCGC CGGGTGGCCA TCCTGACCGA ACCGCCGTCC
ATCGACGCCG GCGAGATCAC CGACAAGGGC TATGTGAACC AGCGCGCGAC GCTGGAGCGC
CGGGCGGCTG TCGTGGCGGG GCTGTTCGCG GAGCCGCCGG GGGAGGGCGT GATCGTGATC
TGA
 
Protein sequence
MDGAEALTAA RDPSSAPFKP LPMKGADISV ERRADGSIVI TSNHPPGDGP RTISHLLAEK 
AAAHPDRPYL NQREPGHGPW RGVTYGQAQR AVEGIAQWLL DQGVTADDSV MILSGNSIEH
ALMTLGAYGA GVPAAPVSPA YSLMSTDHGK LKHCFDTVAP RVVFAQSGVL FDKAIAALRA
IKPDLLLVTA DGTGEGAIAF DTVAATVPRE VAAKRESLTP ATVAKYLFTS GSTGVPKGVP
QTHAMMAGVI AGQEGLRTDE PTDEIPNSLE WMPWSHISAG NIGFNGVLWS GGTLWIDEGK
PLPGMFETTI KNLYEVSPVV FGSAPIAFSM LAGAMENDPV LRAAFFKNLK YMGYGGATLS
DDVYTRMQAL AVAETGHRVP LTTMYGATET QGVTVVHWIT ERVGLVGLPL PGIALKLAPA
GAKYEVRVKG PTVAAGYHKD PEKTAAAFDE EGFYKLGDAA RFVDPKDPSK GLVFDGRVTE
DFKLDSGTWV SVGVLRPDLV AACSPYIHDA VIAGQDKPFA AALVWPSPAG LAALVADQGP
GTPLDKLTAI LRERIGAFNA RAGGSSRCIR RVAILTEPPS IDAGEITDKG YVNQRATLER
RAAVVAGLFA EPPGEGVIVI