Gene Caul_3975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3975 
Symbol 
ID5901437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4305420 
End bp4307012 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content70% 
IMG OID641564496 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_001685598 
Protein GI167647935 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.251646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.301159 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCAG CCGCCGTCGA TTTCGATCGC ATGACGACGT TGGGCGACGT CGCCCGCTAC 
CACCGGCAGG TCCGGCCCGA AGCCACGGCC CTGGTGTTCG AGGGGCGGGC GACCAGCTTC
GCGGACTTCG ACCGCAACAC CGACAGGGTG GCCGCCGCCC TGCTGGCCGA GGGCCTGACC
AAGGGCGACC GCATCGCCTA TGTCGGCAAG AACAGCGACC ACTATTTCGA GCTGCTGTTC
GGCGCCGCCA AGGCCGGGGT GGTGCTGGCC CCGATCGGCT GGCGCCTGGC CCCGCGCGAG
ATCGCCTACA TCCTGGGCGA CGCCGAGGCG CGCATGGTGT TCGTCGGTCC GGAAATGATC
GCCCACGTCC GTGACGTGGC CGAATTGATC CTGGACCAGC CGACGCTGGT CGCCATGGAG
CCCAACGACT ACGGCCACCC GGAATTCATG CCCTGGCGCG ACGCCGCGCC CGAGGATGGC
AAGCCCGCCC ACGTAACCTC GGCCGACATC GCCGTCCAAC TGTACACCTC GGGCACCACC
GGCCGGCCCA AGGGCGCGAT GCTGACCCAC GCCAACATCC TGGGGCCGCG CAGGCTGGCC
GCCGCCGCCG ACATGGCCTG GAACCGCTGG GGGCCGGACG ATGTCAGCCT GGTGGCCATG
CCCGTGGCCC ATATCGGCGG CACCGGCTGG GGCGTGGTCG GGCTGGTCAA CGGCGCCAAG
GGCGTGGTGG CCCGCGAGTT CGACCCGACC AAGGTGCTGG ACTTCATCGA GCGCGACCGG
GTCTCCAAGA TGTTCATGGT GCCCGCCGCC CTGCAGATCG TCGTGCGCCT GCCGCGCGCT
CGCCAGGTCG ACTACAGCCG CCTGACCCAC ATCCTCTACG GCGCGGCCCC CATTCCGCTG
GACCTGCTGC GCGAGTGTCT CGAGGTGTTC GGCTGCGGCT TCGTCCAGCA GTACGGCATG
ACCGAGACGA CCGGCACGGT GGTCTATCTG CCGCCCGAGG ACCACGACCC GGCCGGCAAC
CCCCGCATGC GCTCGGCCGG CCTGCCCATG CCCGGCGTCG AGCTGAGGAT TCTCGGCGAG
GACGGCCGGG TCCTGCCGCC GGGCGAGGTC GGCGAGGTGG CGGTCCGCTC GCCCGCCAAC
ATGGCCGGCT ACTGGAAGCT GCCCGAGGCG ACGGCCGACA CTATCGATTC CGATAGCTGG
TTGCGCACCG GCGACGCCGG CTACATCGAC GCGGACGGCT ACCTGTTCAT CCACGATCGC
GTGAAGGACA TGATCATCAG CGGCGGCGAG AACATCTATC CGGCCGAGGT GGAGAGCGCC
GTCTATGGCC ACCCGCACGT GGCCGAGGTG GCGGTGATCG GGGTGCCCGA CGACACCTGG
GGCGAGGCGG TCAAGGCGGT GGTCGCCCTC AAGCCCGGCG CGCCGCGCGA TCCGGCCGAC
ATCATCGCCT TCTCCCGCAC CCGCATCGCC GGCTTCAAGG CCCCCAAGAC CATCGACTTC
GTCGAGGCTT TGCCGCGCAA CGCCTCGGGC AAGATCCTGC GCCGCGAGCT GCGCGAGCCC
TACTGGGCGG GCAAGACGCG ACGGGTGAAC TAG
 
Protein sequence
MSAAAVDFDR MTTLGDVARY HRQVRPEATA LVFEGRATSF ADFDRNTDRV AAALLAEGLT 
KGDRIAYVGK NSDHYFELLF GAAKAGVVLA PIGWRLAPRE IAYILGDAEA RMVFVGPEMI
AHVRDVAELI LDQPTLVAME PNDYGHPEFM PWRDAAPEDG KPAHVTSADI AVQLYTSGTT
GRPKGAMLTH ANILGPRRLA AAADMAWNRW GPDDVSLVAM PVAHIGGTGW GVVGLVNGAK
GVVAREFDPT KVLDFIERDR VSKMFMVPAA LQIVVRLPRA RQVDYSRLTH ILYGAAPIPL
DLLRECLEVF GCGFVQQYGM TETTGTVVYL PPEDHDPAGN PRMRSAGLPM PGVELRILGE
DGRVLPPGEV GEVAVRSPAN MAGYWKLPEA TADTIDSDSW LRTGDAGYID ADGYLFIHDR
VKDMIISGGE NIYPAEVESA VYGHPHVAEV AVIGVPDDTW GEAVKAVVAL KPGAPRDPAD
IIAFSRTRIA GFKAPKTIDF VEALPRNASG KILRRELREP YWAGKTRRVN