Gene Caul_3540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3540 
Symbol 
ID5900995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3816667 
End bp3818457 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content65% 
IMG OID641564047 
Productlong-chain-acyl-CoA synthetase 
Protein accessionYP_001685165 
Protein GI167647502 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.498772 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTGC CGGCCAGGTT GAAGCGTGAA CTGCGGTTCC TGAAGGGGCT TTCGCGCACC 
CTGAAACGCG TGAAGTCCAT CGCGCCAGAC AGCACCAACC TGATCTGCGA CGACCTGGAA
GCCGCCGTCG ATAAGTGGCG CGACAGCCAG GCCATCGTCT TCGAAGGGCG CTCGGTCACC
TATGCCGAGT TGGACGCCAT CGCCAACCGC TACGCCCACT GGGCCAAGGG GCAGGGGATC
ACCCGCGGCC AGACCGTGGC GCTGTTCATG CCCAACAGGC TGGAATACGT GGCGATCTGG
TACGGACTGT CGAAGGTCGG GGTGGCCACG GCCCTGATCA ACAACCAGCT GACCGGCCCG
GCCCTGGCTC ACTGCCTGAA CATCTCCCAG GCCCTGCACT GCATCGTCGA TCCGGAGACC
TCGTCCTGTT TCGAGCAGGT GAAGGGATCG CTGGAGCGCC ACGTCCAGCA GTGGGTGTTG
GGTCCCGCCT ACGGCGACCA GCGCGATCTG GTCAACGCGC TCAAGAGCTG CAGCCAACTG
CGGCCGGATC GCCTGACGGC CCGTAACGGC CTGACCGCGC GCGACACCGC CCTCTATATT
TTCACCAGCG GCACCACGGG CCTGCCCAAG GCGGCGCGCA TCACTCACAT GCGGGCCCAG
CTCTACATGC GTGGTTTCGC CGGCTCGACC GATGCACGCC ACACCGACCG CATCTACATC
ACCCTGCCGC TCTATCACGC CACCGGCGGC CTGTGCGCGG TGGGCGCGGC CCTGCTGAAC
GGCGGAACAG TGGTGCTGCG CAAGAAGTTC TCGGTCAGCG CCTTCTGGGA CGACGTGGTC
GCCGAGAACT GCACGATGTT CGTCTATATC GGCGAGCTGT GCCGCTACCT GGCCAACCAC
CCGGAGGGTC CCAACGAGCG GGCGCACAAG ATCCGGCTGA TCTTCGGCAA CGGCCTGCGT
CCCGACGTCT GGGACGTGAT GCTCGACCGC TTCAAGGTCG GCGGGGTGCT GGAGTTCTAC
GGCGCGACCG AGGGCAATGT CTCGCTGTTC AACTTCGATG GCCGGCGCGG GGCCATCGGC
CGGGTGCCAG CCTATCTGAA GAAGAAGTTC AACATCCGCA TCGTCAAGTT CGACGTCGAG
ACCGAAACCC CGGTCCGGGC CGCCAACGGC TGCTGCATCG AGGCCGCGCC CGGCGAGATC
GGCGAGTGCA TCGGCCACAT CGCCAGCGAC GCCCGCTCCA ACTTCACCGG CTACGCCGAC
AAGGCCGCCA CCGAGAAGAA GATCCTGCAC GACGTCTTCG AGAAGGGCGA CGCCTGGTTC
CGCACCGGCG ACCTGATGCG CGCCGACAGC GACGGCTATC TCTATTTCAT CGACCGGATC
GGCGACACCT TCCGCTGGAA GGGCGAGAAC GTCGCCACCA GCGAGGTGTC CGAGCGCCTG
TCCGCGGTGC CCGGCGTCAA GGAGGTCAAT GTGTACGGCG TGCCAATCGG CGATCTGGAC
GGCAAGGCCG GCATGGCGGC GCTGGTGGTC GACGGCACGT TCGAGATCGC GGCCCTGGCC
GAATATGTCG ATCGCGAGCT GCCTGTCTAC GCCCGTCCGA TCTTCGTGCG CCTGCAGCCC
GAGATCGAGA CGACAGGCAC CTTCAAGTAT CGCAAGATCG ACCTGGTGAA GGAGGGTTTC
GATCCCGCCA ACACCCGGGA CCCATTGTAT TTCCGCGACC CGGCCAAGGG CTATGTGAAG
CTGACCAAGG CGGTCCACGC CAAGATCCTG GCGGGCGGCT ACAGGCTTTA G
 
Protein sequence
MSLPARLKRE LRFLKGLSRT LKRVKSIAPD STNLICDDLE AAVDKWRDSQ AIVFEGRSVT 
YAELDAIANR YAHWAKGQGI TRGQTVALFM PNRLEYVAIW YGLSKVGVAT ALINNQLTGP
ALAHCLNISQ ALHCIVDPET SSCFEQVKGS LERHVQQWVL GPAYGDQRDL VNALKSCSQL
RPDRLTARNG LTARDTALYI FTSGTTGLPK AARITHMRAQ LYMRGFAGST DARHTDRIYI
TLPLYHATGG LCAVGAALLN GGTVVLRKKF SVSAFWDDVV AENCTMFVYI GELCRYLANH
PEGPNERAHK IRLIFGNGLR PDVWDVMLDR FKVGGVLEFY GATEGNVSLF NFDGRRGAIG
RVPAYLKKKF NIRIVKFDVE TETPVRAANG CCIEAAPGEI GECIGHIASD ARSNFTGYAD
KAATEKKILH DVFEKGDAWF RTGDLMRADS DGYLYFIDRI GDTFRWKGEN VATSEVSERL
SAVPGVKEVN VYGVPIGDLD GKAGMAALVV DGTFEIAALA EYVDRELPVY ARPIFVRLQP
EIETTGTFKY RKIDLVKEGF DPANTRDPLY FRDPAKGYVK LTKAVHAKIL AGGYRL