Gene Caul_2951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2951 
Symbol 
ID5900406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3201864 
End bp3203597 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content72% 
IMG OID641563448 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001684576 
Protein GI167646913 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.982174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCGC CGGGCCAGGA CACGCAACCG CCGGCCACCT TGGTCGCGGG CATGCTGGCG 
GCCGCCGCCG CCTATCCGGA GAACGGTTTC ACCTTCCAGG ACGCGGCGGG CCGCGAGACC
TTCTACAGCT TCCCCGACCT GCTGCTGGCC ACCGAGCGCG CCGCCGCGGG CCTGCAAAGC
CTGGGCCTGG GCCACGGCGA CCGGATCGCC CTGCTGACCC AGGATCCCGA GGAGTTCATC
ATCGCGTTCC TCGGGGCGGT GCGGGCCGGG ATCGCGCCCG CCCCGCTCTA TCCGCCGCCG
CCGCTGGGCG GCATAGAGAT CTATCTGAGC CAGACCGTCG CCCTGCTCGA TGTGGCGCGC
CCCGCCGCCC TGATCGGCTC GGCCAAGGTG CTGGGCGACA TCCAGGCCGC CGTCGCCGGC
CTGGACGGCG TCAAGGCCGT CGCCACCGTG CAGGAGATCC GCGCCTGCCA GGCGCCGATG
ACGCCCTGCG AGGTCGGCCC GGACGACGTG GTCTTCCTGC AGTTCACCTC GGGCTCGACC
AGCACGCCGC GCGGCGTGAT CGTCACCCAC CGCGCCCTGG TCGCCAATAT CGCCTGCTTC
ATGGACCAGT CGCTGCAAGC CGATCCGGCC CGCGACAAGG GCGTCACCTG GCTGCCGCTC
TATCACGACA TGGGGCTGAT CGGCTTCGTG CTGGGACCCG TCCATACTGG CGTCTCGGTG
GTGTTCATGC CGACCGTGCG GTTCGCCAAG TCGCCGGCCG CCTGGCTGGA CGCCCTGCAC
CAGCATCGCG GCACCATCAC CTTCGCCCCC AACTTCGCCT TCGCCCTGCT GCTGCGCCGG
CTGCGGGCCG AGGATCTGGG GCGCTGGGAC CTGTCCTGCG TCAAGGCCCT GGGCTGCGGG
GCCGAGCCGA TCCACCCCGA CCTGATCGAG CGCTTCCTCG ACGTCTTCGC CGCCGCCGGG
CTGAGCCGCG ACGCCTTCCT GCCCGCCTAC GGCCTGGCCG AGGCCACCCT GGCCGTGGCC
CTGCGCCGGC TGGGCGCGCC GGTCAGCACC CAGCGGGTTG ACCGCGAGAC CTTCGAGCGC
ACCGGCGTCT CCACGCCCGC GCGGGAAGAC CGTTCCTGGC TCGATCATGT CGGGTTCGGC
GGCCCCTTCG CGGGACATGA AATCGCTATC CGCGACCCCG ACGGCGCGGC CCTGCCCCAT
GGCCGCGAAG GCGACATCTG GCTGCACGGC CCCTCGGTCT GCGCCGGCTA TCTGGGCGAC
GAGGCCGGCT GGAACGCTAT CTGCCGGGAC GGGTGGCTCA ACACCGGCGA CCGGGGCTAT
CTGGCGGACG GCGAGCTGTT CGTGTCCGGA CGGTCCAAGG AACTGATCAT CGTCAACGGC
CGCAACATCC ATCCCCAGCC GTTGGAATGG GCGGTCAGCG CGCTGTCGGG CGTGCGGCCT
CAATGCGTCG CGGCCTTCGC CGTGCCGTCC CTGACCACCG AGGCCATCGT CATCGCCCTG
GAAGCCAAGG GCCGGCCGAC GACCGATCTG GTGGCCGCCG TCGAGGACGC GGTCGAGGAC
CTGGTCGCCT GCCGGCCGCT CGACGTCGTC CTGCTGCCGT CCGGCTCGCT GTCGCGCACC
ACCTCCGGCA AGCTCAAGCG CGGCCACGTG CGGCGGCGGT ATCTGGACGG CGACCTGCCC
AGATTGGAGC CGACGCCCGT CTCCATGCCC GTTGGGGAGG CAGGCCAACC GTGA
 
Protein sequence
MNPPGQDTQP PATLVAGMLA AAAAYPENGF TFQDAAGRET FYSFPDLLLA TERAAAGLQS 
LGLGHGDRIA LLTQDPEEFI IAFLGAVRAG IAPAPLYPPP PLGGIEIYLS QTVALLDVAR
PAALIGSAKV LGDIQAAVAG LDGVKAVATV QEIRACQAPM TPCEVGPDDV VFLQFTSGST
STPRGVIVTH RALVANIACF MDQSLQADPA RDKGVTWLPL YHDMGLIGFV LGPVHTGVSV
VFMPTVRFAK SPAAWLDALH QHRGTITFAP NFAFALLLRR LRAEDLGRWD LSCVKALGCG
AEPIHPDLIE RFLDVFAAAG LSRDAFLPAY GLAEATLAVA LRRLGAPVST QRVDRETFER
TGVSTPARED RSWLDHVGFG GPFAGHEIAI RDPDGAALPH GREGDIWLHG PSVCAGYLGD
EAGWNAICRD GWLNTGDRGY LADGELFVSG RSKELIIVNG RNIHPQPLEW AVSALSGVRP
QCVAAFAVPS LTTEAIVIAL EAKGRPTTDL VAAVEDAVED LVACRPLDVV LLPSGSLSRT
TSGKLKRGHV RRRYLDGDLP RLEPTPVSMP VGEAGQP