Gene Caul_1315 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1315 
Symbol 
ID5898770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1389485 
End bp1392382 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content70% 
IMG OID641561800 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001682943 
Protein GI167645280 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.689219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGGGT TGATGCAACA CGGGGCCTTG ACGGTCGACA AGATCATCGA CCATGCGGCC 
CAGTGGCACG GCGGCCGCGA AGTCGTCACC CGCTCGGTCG AAGGCCCGAT CGTCCGCACC
ACCTATGGCG CGATCCGCGA CCGCGCCAAG CGAGTGTCAA ACCTCCTGCT GGCCTTGGGG
ATCAAGCCGG GCGACCGGGT GGGAACCCTG GCCTGGAACA CCGCCCGCCA CATGGAGGCC
TGGTACGGGA TCATGGGCAT GGGGGCGGTG TGCCACACCC TGAACCCCCG CCTGTTCCCC
GAGCAGATCG CCTGGATCGC CAACCACGCT GGCGACCGCG TGATCTTCAC CGACCTGACC
TTCCTGCCGA TCGTCGCCGG GATCCTGCAC CACCTGCCCG CCGTCGAGCA CGTGGTGCTG
TTCACCGACC GCGGCCACAT GCCCGCCGGC TTCACGCCGG CCGGCGAGGC GCCGAACTTC
AAGGGCTTGC TTTGCTACGA GGACCTGGTC GAGCAGCACC CCGCCGACTG CGCCTGGGGC
GGCTTCGACG AGGGTACGGC GGCTGGACTC TGCTACACCT CGGGCACGAC GGGCGACCCC
AAGGGCGTGC TCTATTCGCA CCGCTCCAAC GTGCTGCACA CCCTGATCAC CCTGCAGCCG
GACGTGATGG GCCTGTCGCA GCGCGACGTG ATCTTGCCGG TGGTGCCGAT GTTCCACGCC
AACGCCTGGG GCGTGGCGTT CTCGGCGCCC GGTACCGGCG CCAAGATGGT CATGCCCGGT
GGCAAGATGG ACGGGGCGTC GATCTATGAG TTGCTCGATA GCGAAGGCGT CACCTTCTCG
GCCGCCGTGC CCACCGTCTG GCAGATGCTG CTGCAGTATC TGAAGGAAAG CGGCGCCAAG
CTGCCGGTGC TGAAGAAGGT GGTGATCGGC GGGGCGGCCT GTCCTGAGGT CATCATCCGC
GCCTTCCAGG AGGATTACGA CGTCGAGGTG GTCCACGCCT GGGGCATGAC CGAGACCTCG
CCGGTCGGCG CCCTGTCGGT GATGACCGAC GAACTGGCCA AGCTGCCCTA TGACCAGCAG
ATGCCCTATC GGCTCAAGCA GGGCCGGCCG CCGTTCGGTG TCGAGCTGAA GTTGACCAAC
GACTTGGGCG AGCGCCTGCC GCACGACGGC AAGAGCTTTG GCAACCTGAA GATCCGCGGT
CCGATCATCG TCGCCGAATA CTTCCGCGGC GCTGGCGGCA AGATCCTCGA CGACGAGGGC
TTCTTCGACA CCGGCGACGT GGCCACGATC GACGAGCACG GCTTCATGCA GATCACCGAC
CGGGCCAAGG ACGTGGTCAA GTCGGGCGGC GAGTGGATCA GCACGATCGA CATCGAGAAC
ATCGCCCTGG GTCATCCCAA GGCGGCGATG ACGGCGGTGA TCGGCGTGCC TCATCCCAAA
TGGGACGAGC GACCGATCCT GCTGGTCAAG CTGAACGACG GCGAGACCGC CACCAAGGAA
GAGTTCCTGG AGTTCCTGCG GGGCAAGATC GCCAAGTGGT GGATGCCCGA CGACGTGATC
TTCGTCGACG AGATCCCATT GGGCGCGACG GGCAAGGTCG ACAAGAAGCT GATCCGCCAG
CGGATGCAGG GCTATGTGCT GCCGGGTCTG GCCGCCGAGG CGGTCGCCGA GGTCGCGACC
GAGGCCCAGT CCGAAGGCGA GTCCGCGGTC CAGTCCGAGC CGCCGGCCCC GACCCTGGCC
GCCGCCGACG GCGCCCAGGC GGCGCGCATC TACGCGCCCG AGCCGGAGGA CCCGCTGTCG
GAACCGCCGC CGCTGCTCGA CCCCCAGGTC GTCGAGGCCT TGGCCGAGGC GGAAGCCGAA
TCCGAGGCCG CGCCGGAACC GGAGGCCGAA CCGCTGGTCG CGACCGCCGC CCCCGGGGCC
GAGCCGGTCT TCGAGTCCCG CGCCACGGAG GAGCATTTCC CGCTTGGCCC GCTTCCCCCG
GCGACGGAGG CCGAGCTGGC GCCGGTTCCC GAGGAGGCCT TCCACGCCAA GCCGGTCTTC
GTCGCCGAGG AGGCGCCCCT GGTCATGCCG TTGGTTCCGC CGAGGGCGCG CCGCAAGGGC
GGAAAGTCCA AGGTCGGCGG CAAGCCCGGC GGCCTGACGG CGGGCTTGCT TGACCTGGCG
ATCCTGGTCG CCCTGGCGCC CGCCCTGCTG GTCGCGGCCG GCGCGCTGGG CGTGAAGTTC
GGCCTGTTCC CGCTGGCGGT CGGTTATGAC CAGATGACCC TGGACTGGGC GCCCAAGGCG
GCGCTGGCCA GCGCCGCCAC CGGGGTCCTG GGCCTGATCG CGGCCCTGTT CGGCGGCTTC
TCGCGATACT GGGCCAAGGC CCTGCTGGCC CTGGCGATCA CCGTCGTCAC GATCGGCGCC
ATGGTGGCGG CCAACGCCCT GGGCGGCCGC GCGCCGCCGA TCCACGACGT CTCGACCGAC
TGGAAGACGC CGCTGATGCT GTCGGACGCC GGCCTGGCGG CGCGCGGCGA CCAGGCCCAG
ACCGTCGAGG AGGATCCCAG CCTGCCGGTC GGCTCGCTGG CCTTCGCCGG CCGGCGGATC
GCCGACGTCA ACGCCGAGAC CTGTCCGGCC GCCCGGCCGC TGGTGCTGGC GCGCTCGCCG
GCCGACGCCT ACGAGTCGGC CAAGGCCGCC GTTCAGGCCG CCGGTCTGTC GATCGTCACC
GACGACCCGA TGGACGGCCG GCTGGAGGCC ACGGGCCAGA GCTTCTGGTA CGGGCTGAAG
GACGACCTGA TCGTGCGGGT GCGCCCCGAC ACGGCCGGCG CCCGGGTCGA CATGCGCTCG
GTCGGCCGCA CCGCCGGCGC CGACATGGGC CGCAACTGCC GCCGGGTGGG CGGGCTGCTG
GCGGTGGTGA AGGGGTAG
 
Protein sequence
MQGLMQHGAL TVDKIIDHAA QWHGGREVVT RSVEGPIVRT TYGAIRDRAK RVSNLLLALG 
IKPGDRVGTL AWNTARHMEA WYGIMGMGAV CHTLNPRLFP EQIAWIANHA GDRVIFTDLT
FLPIVAGILH HLPAVEHVVL FTDRGHMPAG FTPAGEAPNF KGLLCYEDLV EQHPADCAWG
GFDEGTAAGL CYTSGTTGDP KGVLYSHRSN VLHTLITLQP DVMGLSQRDV ILPVVPMFHA
NAWGVAFSAP GTGAKMVMPG GKMDGASIYE LLDSEGVTFS AAVPTVWQML LQYLKESGAK
LPVLKKVVIG GAACPEVIIR AFQEDYDVEV VHAWGMTETS PVGALSVMTD ELAKLPYDQQ
MPYRLKQGRP PFGVELKLTN DLGERLPHDG KSFGNLKIRG PIIVAEYFRG AGGKILDDEG
FFDTGDVATI DEHGFMQITD RAKDVVKSGG EWISTIDIEN IALGHPKAAM TAVIGVPHPK
WDERPILLVK LNDGETATKE EFLEFLRGKI AKWWMPDDVI FVDEIPLGAT GKVDKKLIRQ
RMQGYVLPGL AAEAVAEVAT EAQSEGESAV QSEPPAPTLA AADGAQAARI YAPEPEDPLS
EPPPLLDPQV VEALAEAEAE SEAAPEPEAE PLVATAAPGA EPVFESRATE EHFPLGPLPP
ATEAELAPVP EEAFHAKPVF VAEEAPLVMP LVPPRARRKG GKSKVGGKPG GLTAGLLDLA
ILVALAPALL VAAGALGVKF GLFPLAVGYD QMTLDWAPKA ALASAATGVL GLIAALFGGF
SRYWAKALLA LAITVVTIGA MVAANALGGR APPIHDVSTD WKTPLMLSDA GLAARGDQAQ
TVEEDPSLPV GSLAFAGRRI ADVNAETCPA ARPLVLARSP ADAYESAKAA VQAAGLSIVT
DDPMDGRLEA TGQSFWYGLK DDLIVRVRPD TAGARVDMRS VGRTAGADMG RNCRRVGGLL
AVVKG